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FOREWORD 


The  Third  Army  Conference  on  Applied  Mathematics  and  Computing  was  held  at  the 
Georgia  Institute  of  Technology,  Atlanta,  Georgia  on  13-16  May  1985.  The 
sponsor  of  these  annual  meetings  is  the  Army  Mathematics  Steering  Committee 
(AMSC).  Its  members  would  like  to  thank  Professor  William  F.  Aimes  for  his 
invitation  to  hold  the  conference  at  his  university,  and  for  his  outstanding 
work  as  the  Chairperson  on  Local  Arrangements.  He,  along  with  Professors  M. 

F.  Barnsley,  Albert  Bharucha-Reid ,  and  R.  W.  Schafer  organized  the  three 
special  Sessions  for  this  meeting.  Unfortunately  Professor  Bharucha-Reid 
passed  away  before  this  conference  was  held,  but  his  help  in  selecting  the 
speakers  and  other  planning  phases  contributed  a  great  deal  to  its  success. 

He  was  a  skillful  mathematician  whose  advice  and  council  was  often  sought,  and 
to  many  in  the  scientific  community  he  was  a  friend  and  so  as  time  gives  on  he 
will  be  missed  more  and  more. 


The  program  of  the  present  conference  consisted  of  three  parts,  namely: 

(a)  Seven  invited  addresses;  (b)  Three  special  sessions;  and  (c)  Contributed 
papers  by  Army,  academic  and  other  scientific  personnel  .  The  sixteen  speakers 
for  the  special  sessions  were  selected  by  their  organizers.  These  sessions 
carried  the  titles:  "Algorithmic  Issues  in  Multi-Dimensional  Digital  Signal 
Processing,"  "Applications  of  Chaotic  Dynamics,"  and  "Invariant  Solutions  of 
Partial  Differential  Equations."  The  subcommittee  of  the  AMSC  that  oversee 
these  conferences  was  very  pleased  with  the  high  scientific  quality  of  the 
fifty-one  contributed  papers.  The  Army  scientists  had  an  opportunity  to  hear 
and  talk  with  many  nationally  known  mathematical  scientists  during  the  course 
of  this  meeting.  Some  of  these  were  the  invited  speakers,  who  are  listed 
below  together  with  the  titles  of  their  addresses,  but  also  with  many  others 
that  appeared  on  the  program  or  were  members  of  the  audience. 


SPEAKERS  AND  AFFILIATION 

Professor  J.  T.  Schwartz 

Courant  Institute  of  Mathematical 

Sciences 


Professor  S.  Rosenblatt 
Illinois  Institute  of  Technology 

Dr.  V.  K.  Stokes 
General  Electric  Company 

Professor  0.  Strickwerda 
University  of  Wisconsin 

Professor  S.  N.  Atluri 
Georgia  Institute  of  Technology 


TITLES  OF  ADDRESS 

Identification  of  Partially  Obscured 
Objects  in  Two  and  Three  Dimensions  by 
Matching  of  Noisy  "Characteristic 
Curves" 

Bifurcation  and  Stability  of 
Viscoelastic  Fluid  Flows 

The  Role  of  Modelling  in  an  Industrial 
Environment 

Finite  Difference  Methods  for 
Elliptic  Systems 

Computational  Aspects  of  Finite  Strain 
Inelastic  Solid  and  Fracture  Mechanics 


iii 


Professor  J.  R.  Rice 
Purdue  University 


Using  Supercomputers:  Today  and 
Tomorrow 


Professor  M,  H.  Schultz  Parallel  Computing  and  Fluid  Dynamics 

Yale  University 

A  large  number  of  individuals  contributed  to  the  success  of  the  conference,  in 
particular,  the  many  speakers,  the  chairpersons,  the  host  personnel,  and  the 
active  and  enthusiastic  members  of  the  audience.  The  members  of  the  AMSC  were 
please  that  most  of  the  speakers  were  able  to  find  time  to  prepare  their 
papers  for  these  Transactions.  These  research  articles  will  enable  many 
persons  that  were  not  able  to  attend  the  symposium  to  profit  by  these  contribu¬ 
tions  to  the  scientific  literature. 
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Some  Remarks  on  Robot  Vision^ 

Jacob  T.  Schwartz  &  Micha  Sharir 
Courant  Institute  of  Mathematical  Sciences 
and 

School,  of  Mathematical  Sciences,  Tel  Aviv  University 
ABSTRACT 

This  paper  considers  the  problem  of  using  ‘depth*  images  of  portions  of 
3-D  objects,  drawn  from  a  finite  vocabulary  of  potential  objects  having 
known  geometry,  to  identify  the  objects  and  determine  their  orientation  when 
the  objects  rre  viewed  from  an  unknown  angle.  Several  techniques,  including 
a  simple  ‘probing’  method  which  can  be  used  when  at  lease  two  object 
features  constrain  its  orientation,  and  the  use  of  semi-local  invariant 
parameters  of  shape  are  suggested. 

A  robust  boundary-matching  method,  which  determines  best  partial 
least-squares  matchings  rapidly  is  then  described.  Finally,  techniques  for 
using  the  silhouette  of  a  polyhedral  body  to  determine  its  identify  and 
orientation  is  described. 

1.  Introduction 

The  goal  of  robotics  is  to  develop  general-purpose  mechanisms  having  ‘opera¬ 
tive*  intelligence,  i.e.  that  rudimentary  level  of  intelligence  which  is  displayed  in 
the  every-day  handling  of  objects  in  the  workplace  and  the  home.  For  this  level  of 
capability  to  be  realized,  a  robot  will  need  to  maintain  at  least  a  partial  model  of 
its  environment  internally.  Such  a  model  would  represent  the  (known  aspects  of 
its)  environment  in  symbolic  fashion,  as  a  collection  of  ‘objects*  having  known 
shape  and  orientation.  Rigid  objects  are  simplest;  however  a  complete  environ¬ 
ment  model  would  eventually  have  to  accommodate  flexible  objects  like  rope, 
paper  and  cloth,  liquids,  soft  objects  (e.g.  mashed  potatoes),  amorphous  objects 
like  dustpiles  or  heaps  of  crumbs,  etc.  Foregoing  these  interesting  but  more  diffi¬ 
cult  problems,  the  following  remarks  will  concentrate  on  the  relatively  simple 
class  of  rigid  objects  and  on  the  problem  of  identifying  such  objects  and  determin¬ 
ing  their  orientations  so  that  they  can  be  manipulated  by  a  robot.  Of  course, 
manipulation  also  requires  an  understanding  of  object  properties  and  inter-object 
relationships  such  as  centers  of  mass,  relationship  of  support,  and  coefficients  of 
friction,  all  of  which  are  concepts  which  a  capable  robot  will  have  to  understand. 
However,  we  ignore  all  these  issues  to  focus  on  the  underlying,  still  unsolved, 
problem  of  how  to  recognize  objects,  seen  from  unknown  orientations,  and  possi¬ 
bly  seen  as  parts  of  complex  multi-object  scenes. 

^Work  on  this  paper  has  been  suported  in  part  by  CXfice  of  Naval  Research  Grant  N00014' 
82-K-0381,  by  a  grant  from  the  US-Israel  Binational  ^ence  Foundation,  and  by  grants  from  the 
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Object  recognition  begins  with  raw  perceptions,  i.e.  pixel  arrays.  These 
must  be  analyzed  into  objects  which,  if  rigid,  can  be  identified  by  the  shapes  of 
their  bounding  surfaces  (but  may  also  have  properties  other  than  their  shape 
which  can  be  used  to  identify  and  locate  them,  including  color,  albedo,  acoustic 
reflectance,  magnetic  behavior,  visual  texture,  electrical  behavior;  indeed,  any 
property  that  a  sensor  can  detect).  To  analyze  the  objects  in  a  robot’s  environ¬ 
ment  will  be  more  or  less  difficult  depending  on  whether  the  objects  which  can 
appear  are  known  a  priori,  and  on  whether  observation  can  be  maintained  con¬ 
tinuously  or  only  applied  occasionally. 

(1)  In  a  highly  controlled  environment,  it  may  be  known  that  only  certain 
objects,  or  only  objects  belonging  to  known  classes,  whose  members  are  pre¬ 
cisely  characterized  by  a  small  number  of  parameters,  can  be  present.  If 
these  objects  are  known  to  change  position  only  when  the  robot  moves  them, 
and  if  none  of  the  robot’s  manipulations  miscarry,  it  may  be  possible  to  keep 
track  of  the  objects,  without  much  sensing,  by  a  kind  of  ‘dead  reckoning’. 
Even  if  some  manipulations  miscarry,  it  may  be  known  (or  plausibly 
assumed)  that  only  certain  of  the  objects  will  move  to  unknown  positions 
when  an  attempted  motion  fails.  It  may  then  be  possible  to  find  these  objects 
again  by  differencing  the  pre-manipulation  scene  and  its  miscarried  result, 
e.g.  after  dropping  coins  on  a  smooth  floor  one  can  locate  them  again  by 
searching  visually  for  shiny  raised  areas  on  the  floor. 

(2)  Even  if  some  of  the  objects  move  independently  of  the  robot  system,  it  may 
be  possible  to  maintain  a  valid  environment  model  by  keepnng  the  moving 
objects  under  continuous  observation.  In  that  case  the  robot  system  may  be 
able  to  foUow  the  position  of  aU  objects  at  all  times,  and  wUl,  e.g.,  retain  the 
capability  of  returning  them  to  their  base  positions  on  command.  For  exam¬ 
ple,  a  future  home  robot  system  with  multiple  eyes  built  into  the  walls  of  a 
house  might  be  able  to  keep  all  a  family’s  dishes  under  continuous  observa¬ 
tion  during  a  meal  or  party,  after  the  conclusion  of  which  it  could  return 
them  to  their  standard  positions  (after  cleaning). 

(3)  When  new  objects  can  appear  in  the  robot’s  environment,  they  will  at  first  be 
perceived  simply  as  unexpected  surfaces.  It  will  then  be  necessary  to  analyze 
these  surfaces  into  the  objects  for  which  they  belong.  This  process  may  be 
able  to  exploit  a  priori  knowledge  that  only  objects  of  certain  categories  will 
ordinarily  appear  in  the  robot’s  environment. 

To  be  fully  useful,  the  recognition  capabilities  described  in  the  preceding 
pages  need  to  be  organized  appropriately.  What  is  wanted  is  basically  a  pair  of 
procedures.  The  first  of  these  should  be  an  object  acquisition  procedure  to  which 
a  succession  of  objects  can  be  presented  and  their  identities  supplied.  The  acquisi¬ 
tion  procedure  should  record  the  object  shapes  in  some  suitably  compressed  form 
and  can  [tre-proccss  these  shapes  so  as  to  obtain  a  collection  of  efficient 
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disoiminating  tests  for  subsequent  object  identification.  The  second  procedure 
will  then  use  this  data  to  ingest  a  scene  containing  one  or  more  of  these  objects 
and  convert  it  into  a  list  of  the  objects  present,  each  with  its  identity  and  orienta¬ 
tion. 

Gomg  one  step  further,  we  can  describe  the  goal  of  vision-based  object 
recognidoir  system  as  follows.  The  system  should  maintain  and  continually  update 
a  set  of  ‘registers’,  one  for  each  object  observed;  each  of  these  registers  should  at 
all  times  contain  the  position  and  orientation  of  the  corresponding  object.  For 
moving  objects,  the  system  should  update  this  information  continuously.  This 
implies  that  after  initial  scene  analysis  the  system  must  frequently  probe  to  deter¬ 
mine  how  the  objects  in  the  scene  have  moved,  and  whether  new  objects  have 
entered  the  scene. 

The  values  present  in  such  continuously  updated  registers  can  be  considered 
to  represent  the  natural  outputs  of  an  advanced  robot  vision  capability  since  they 
are  just  what  the  system  needs  to  control  and  manipulate  its  environment.  Practi¬ 
cal  progress  in  scene  analysis  will  be  defined  by  the  classes  of  2-D  and  3-D  objects 
for  which  we  are  able  to  make  such  a  symbolic  interface  to  the  real  world  avail¬ 
able  and  reliable. 

2.  Advantageous  Forms  of  Raw  Data 

Visual  information  is  most  useful  if  it  is  given  as  3-D  visual  data,  i.e.  if  true 
3-D  coordinates  are  immediately  calculated  for  all  points  observed.  We  note  that 
devices  which  provide  such  3-D  data,  usually  based  on  laser-beam  scanning  or  on 
use  of  specially  ‘structured  light’  are  commercially  available  already;  sec  e.g. 
[S79],  [Sc83],  [T83]. 

The  crucial  advantage  of  3-D  vision  is  that  it  allows  images  to  be  acquired 
by  arbitrarily  many  eyes.  Whereas  to  take  ordinary  (2‘D)  images  acquired  by 
several  eyes  and  combine  them  is  not  easy,  multiple  3-D  images  of  a  single  scene 
combine  in  a  trivial  way,  since  they  all  refer  to  surfaces  in  a  common  geometric 
space.  This  makes  it  possible  to  use  arbitrarily  many  eyes,  some  fixed,  others 
mounted  on  moving  parts  of  the  robot  system.  (Eyes  need  to  be  mounted  on  the 
robot  itself  if  either  the  robot  can  roam  freely,  or  to  ensure  that  the  space  near 
moving  portions  of  the  robot  is  not  obscured,  either  by  an  intervening  object  or  by 
parts  of  the  robot  itself.  This  second  purpose  may  require  specialized  eyes  of 
appropriate  form  and  position.)  Note  that  an  ‘all  seeing’  eye  system  of  this  sophis¬ 
tication  subsumes  a  quite  satisfactory  proximity  sensor,  and  makes  other  forms  of 
proximity  sensors  superfluous. 

Of  course,  this  still  leaves  open  technical  questions  such  as:  how  to  combine 
separate  observations;  what  to  do  when  surfaces  seen  by  more  than  one  eye  differ 
discemably;  and  when  to  reject  an  interpretation  because  of  unacceptably  large 
discrepancies.  Nevertheless  3-D  images  arc  basically  favorable  for  combination, 
while  2-D  images  are  basically  much  less  favorable. 
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3.  Image  Interpretation  Techniques 

The  visual  data  gathered  by  a  3-D  sensor,  i.e.  ‘3-D’  or  ‘depth’  images,  can  be 
grouped  in  a  table  listing  all  points  in  3-space  which  lie  on  some  reflecting  surface 
of  one  of  the  objects  present  in  a  scene.  However,  since  aU  sensor-gathered  data 
is  partly  corrupted  by  noise,  acquisition  of  3-D  images  leads  at  once  to  the  prob¬ 
lem  of  bow  to  identify  objects,  given  slightly  noise-corrupted  images  (or,  in  other 
cases,  silhouettes)  of  them. 

Whether  depth  images  or  silhouettes  are  in  question,  we  shall  assume  that 
the  objects  present  in  the  scene  to  be  analyzed  are  drawn  from  some  known  col¬ 
lection  of  possible  objects  . This  assumption  makes  the  image  analysis 

problem  ‘objective’  rather  than  ‘psychological’;  one  just  wants  the  compruter  look¬ 
ing  at  a  scene  to  calculate  an  integer  or  a  finite  set  of  integers  that  tells  us  exactly 
which  of  a  known  list  of  possible  candidate  objects  it  sees,  and  from  what  angles 
it  sees  them.  However,  our  simplifying  assumption  still  leaves  us  free  to  consider 
any  one  of  a  scale  of  image  interpretation  problems  of  gradually  increasing  diffi¬ 
culty,  all  of  which  are  ‘objective’  in  the  sense  just  mentioned,  and  all  of  which 
would  contribute  robot  capabilities  of  practical  significance  if  solved: 

(1)  The  bodies  present  in  the  scene  can  be  wholly  visible  or  may  be  partially 
obscured. 

(2)  The  bodies  can  be  straight-sided  (polygonal  or  polyhedral),  or  curved. 

(3)  If  the  bodies  present  in  the  scene  are  polygonal  or  polyhedral,  they  may 
either  be  known  to  lie  in  some  constrained  orientation,  (e.g.  standing  on  an 
edge  or  face,  atop  a  flat  surface),  or  can  be  present  in  completely  arbitrary 
orientations. 

(4)  We  may  be  able  to  assume  that  the  scene  contains  just  one  object,  or  may 
need  to  deal  with  scenes  containing  multiple  objects,  which  may  have  either  a 
single  uniform  orientation,  or  many  different  orientations. 

(5)  Instead  of  fixed  objects,  we  may  need  to  deal  with  objects  which  are  known 

only  to  belong  to  one  of  a  finite  sequence  of  object  classes  Oj(j) . 

each  of  which  depends  on  one  or  more  shape  parameters  s  (e.g.  in  a  home 
robotics  application,  cylindrical  cans  of  various  heights  and  widths  may  be 
encountered). 

This  list  defines  a  family  of  problems  for  whose  solution  appropiriate  algo¬ 
rithmic  or  heuristic  approaches  arc  needed.  The  efficiency  of  the  approach 
selected  will  be  important,  especially  in  the  dynamic  case  where  the  system  needs 
to  keep  track  of  moving  objects  in  real  time. 

The  remarks  which  follow  will  describe  various  semi-algorithmic  heuristics  of 
gradually  increasing  complexity  which  can  be  used  to  handle  some  of  the  prob¬ 
lems  listed  above.  Some  of  these  approaches  have  been  simulated,  and  where  pos¬ 
sible  we  will  note  the  results  of  simple  numerical  experiments.  The  approach  pro¬ 
posed  is  related  to  one  explored  in  a  series  of  papers  by  Y.  Shirai  of  the  Tsukuba 
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Electotechnical  Laboratory  in  a  scries  of  papers;  sec  [OS751,  [OS79],  [S791, 
[SKOI83],  [SS71]. 

4.  Recognition  of  2-D  Objects 

Two  basic  approaches  to  recognition  of  2-D  objects  drawn  from  a  finite  col¬ 
lection  of  candidate  objects  can  be  proposed.  The  first  approach  applies  successive 
‘probes’  to  the  object,  gathering  sufficient  information  to  discriminate  it  from 
among  other  potential-  objects  (and  to  determine  its  orientation).  This  approach 
deals  particularly  easily  with  simple  polygonal  objects,  but  sidesteps  the  issue  of 
shape  description.  The  second  approach  associates  a  global  shape  descriptor  with 
each  object  viewed,  and  then  matches  this  descriptor  to  pre-calculated  similar 
descriptors  of  the  model  objects  expected  to  be  seen. 

Recognition  by  Probing 

By  processing  raw  image  data  in  simple  ways  we  can  apply  various  logical 
‘probes’  to  it.  (In  the  absence  of  visual  data,  probing  can  be  accomplished 
mechanically  by  detecting  object  contact  using  touch  sensors.)  Each  such  probe 
can  be  considered  to  move  along  some  specified  curve  y  from  a  gjven  position  in  a 
given  direction  until  the  first  intersection  of  y  with  an  object  present  in  the  scene 
is  detected.  The  curves  along  which  one  probes  can  be  straight  lines,  circles,  etc. 
If  only  silhouettes  of  an  object  are  gjven,  we  can  still  think  of  ‘silhouette  probes’, 
i.e.  probes  in  the  silhouette  plane  which  end  on  encountering  a  point  of  the 
silhouette  boundary. 
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Figure  1(c):  Silhouette  probe  of  a  3-D  object 
(Distance  from  camera  of  point  p  need  not  be  known.) 

To  see  how  easily  objects  can  sometimes  be  identified  by  probing,  consider 
the  simplest  2-D  case,  in  which  we  arc  given  a  polygon  standing  on  one  of  its 
edges.  In  this  particularly  elementary  situation,  the  first  probe  conducted  will 
establish  a  point  against  which  the  polygon  can  be  considered  to  ‘fit’,  and  then, 
knowing  the  point  at  which  the  probe  contacts  the  polygon,  we  know  that  the 
polygon’s  possible  positions  are  restricted  to  a  finite  set.  This  allows  us  to  build 
the  finite  set  of  points  at  which  a  further  probe  line  could  intersect  one  of  the 
polygons  which  might  confront  us,  in  one  of  its  finitely  many  possible  orienta¬ 
tions.  Suppose  we  divide  this  probe  line  into  minimal  resolvable  intervals  (deter¬ 
mined  by  the  precision  of  the  instrument  with  which  probes  are  conducted). 
Count  the  number  of  such  intervals  which  contain  points  of  intersection  and  cal¬ 
culate  the  entropy  of  the  associated  subdivision;  this  is  the  resolving  power  of  the 
probe  line.  For  efficiency  one  will  then  want  to  probe  along  the  line  whose  resolv¬ 
ing  power  is  greatest.  Only  a  subset  of  the  ori^nal  set  of  polygons  and  orienta¬ 
tions  will  remain  as  candidates  after  this  first  probe,  and  then  one  can  apply  a 
second  probe  which  has  greatest  resolving  power  for  this  subset,  etc.  The  tree¬ 
like  search  which  results  will  determine  the  identity  of  the  observed  polygon  and 
the  edge  that  it  is  standing  on.  Normally  very  few  probes  will  be  required.  The 
first  probe  should  be  at  a  level  which  minimizes  the  expjeaed  number  of  probes 
subsequently  required.  (This  style  of  searching  associates  a  notion  of  ‘entropy’, 
relative  to  the  imprecision  of  the  probing  instrument,  with  the  given  set 
Oj . c>4  of  objects;  this  ‘entropy’  is  likely  to  have  interesting  invariance  proper¬ 

ties,  and  deserves  closer  study.) 

Note  that  the  pirobing  procedure  outlined  is  independent  of  any  assumption 
of  convexity. 


7 


Ncjrt  consider  the  somewhat  less  trivial  case  of  a  convex  polygon  whose  ini¬ 
tial  orientation  is  totally  unknown  (but  assume  that  we  know  one  point  interior  to 
it).  Probe  the  polygon  twice  to  determine  two  points  on  its  periphery,  and  then 
track  the  segment  between  them  to  determine  whether  these  two  points  belong  to 
the  same  polygon  edge  (we  assume  that  such  a  'generalized  probing’  operation  is 
available).  If  not,  probe  at  a  point  intermediate  between  the  two  first  points,  and 
repeat.  Eventually  we  must  l^d  two  points  which  lie  together  on  the  same  edge 
of  the  polygon;  this  reduces  us  to  the  case  considered  previously,  since  the 
polygon  may  be  considered  to  be  ‘standing’  on  this  edge. 

Similar  ideas  can  be  applied  to  the  more  interesting  case  of  a  curved  2-D 
region.  To  avoid  complications  suppose  first  that  the  region  is  convex.  If  we  can 
locate  one  point  P  fixed  relative  to  the  region,  then,  by  probing  along  a  circle 
about  this  point  as  center,  we  can  orient  the  region  so  that  a  chord  through  the 
point  P  having  a  standard  length  D  can  be  regarded  as  horizontal.  This  standard¬ 
izes  the  position  of  the  region  to  one  of  a  finite  collection  of  possible  positions, 
and  then  we  can  use  the  kind  of  ‘probe  tree’  described  above  to  determine  which 
one  of  these  possible  orientations  it  has. 

If  the  whole  of  a  region  is  visible,  its  centroid  can  serve  as  such  an  anchor 
point.  Similar  use  could  also  be  made  of  the  two  most  distant  points  of  the 
region,  of  the  point  of  the  region  most  distant  from  the  line  coimecting  these  two 
points,  etc.  Polygon  comers  can  obviously  be  used  as  anchor  points;  acute 
comers,  of  which  only  a  few  can  exist,  are  obviously  preferable  to  obtuse  comers. 
It  is  only  necessary  that  any  point  which  probing  might  identify  with  a  particular 
anchor  point  P  should  belong  to  some  relatively  small  known  set  of  points  fixed 
relative  to  the  region,  but  when  the  anchor  point  is  ambiguous,  the  probe  tree 
which  we  build  up  must  reflect  all  the  possible  points  that  might  be  confused  with 
it. 

Next  consider  the  case  of  partially-obscured  2-D  objects.  The  preceding 
observations  suggest  that  the  first  step  in  recognizing  partially  obscured  objects 
should  be  to  defme  noise-immune  anchor  points  which  can  be  located  even  if  part 
of  the  region  is  obscured.  This  case  is  of  course  more  difficult  than  that  in  which 
the  whole  object  is  visible,  because 

(1)  For  totally  visible  objects,  obvious  ‘global’  anchor  points  such  as  the  object 
centroid  are  available,  while  for  partially  occluded  objects  anchor  points  must 
be  calculated  from  relatively  ‘local’  data. 

(2)  If  the  object  being  observed  is  nonconvex,  then  the  (boundary  of  the)  convex 
hull  of  the  portion  being  observed  need  not  lie  on  the  convex  hull  of  the 
whole  object  (sec  Fig.  11). 

Of  course,  if  the  observed  portion  of  the  object  has  sharp  discriminating 
features  such  as  acute  comers,  then  fmding  an  anchor  point  will  be  relatively 
easy.  If  no  such  sharp  features  are  present,  the  problem  becomes  more  difficult. 
One  way  of  approaching  it  is  by  associating  the  object  with  some  appropriate, 
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geometrically  defined  function  on  the  unit  drde/sphere  of  directions  in  2-D 
(resp.  3-D)  space.  The  function  must  be  one  whose  geometric  definition  makes  it 
invariant  with  respect  to  Euclidean  motions  of  the  region  to  be  analyzed.  When¬ 
ever  such  an  artificial  'color'  shows  sharp  transitions  or  peaks,  these  can  be  used 
to  defme  the  anchor  points  that  we  need.  In  effect,  this  notion  of  ‘artificial  color’ 
converts  the  shape  recognition  problem  into  the  problem  of  recognizing  ‘colored 
beachballs'  when  these  are  seen  from  an  unknown  orientation,  a  problem  for 
which  the  presence  of  spots  or  regions  of  sharply  defined  color  wUl  clearly  be  sig¬ 
nificant. 

Artificial  colors  of  the  type  proposed  can  be  defined  in  very  many  ways,  but 
we  want  to  chose  one  which  has  peaks  or  which  varies  sharply  in  the  vicinity  of 
geometrically  significant  boundary  features  of  the  body  to  be  analyzed.  One  pos¬ 
sible  scheme  is  as  follows:  take  a  modified  "carpenter’s  square"  MCS,  consisting 
of  two  half-lines  making  some  standard  angle  a  (  a  =  90°  would  be  the  standard 
carpenter’s  square)  and  fit  it  over  the  region  so  that  both  of  its  two  sides  touch 
the  boundary  of  the  region.  The  point  at  which  this  contact  occurs  is  determined 
by  the  orientation  6  of  (some  distinguished  one  of)  the  sides  of  MCS. 


Figure  2:  Modified  ‘carpenter’s  square’  in  contact  with  two  points 
of  a  body.  ‘Leading’  side  is  at  angle  6  to  the  horizontal. 
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Let  A  be  the  apex  of  the  modified  carpenter's  square  MCS.  Take  the  seg¬ 
ment  connecting  the  two  points  of  contact  between  the  region  and  MCS,  take  the 
midpoint  A  and  M  of  this  segment,  and  then  find  and  record  the  distance  d(e)  = 
d(0,a)  between  the  point  x  at  which  the  line  from  A  to  M  crosses  the  region 
boundary. 


of  this  segment  xA 


Figure  3:  Using  a  modified  carpenter’s  square  to  measure  a  region’s  aver¬ 
age  boundary  curvature. 

If  the  region  were  simply  a  circle  of  radius  R,  then  the  distance  rf(0)=<f(0,a) 
would  be  independent  of  0,  and  would  in  fact  be  /?(suia/2)  ^—1.  Thus  d(0)  meas¬ 
ures  a  kind  of  average  of  the  curvature  of  the  periphery  of  the  region;  averaged, 
that  is,  over  the  section  of  periphery  between  its  the  region’s  two  points  of  contact 
with  MCS. 

The  angle  a  that  can  be  used  in  measuring  the  periphery  of  a  partially 
obscured  region  depends  on  what  portion  of  the  periphery  is  visible.  To  use  an 
angle  a,  the  tangent  to  the  visible  portion  of  the  periphery  must  turn  through  an 
angle  exceeding  180  **  -  a.  The  closer  a  approaches  180®,  the  closer  (sinay2)~i-l 
comes  to  0,  and  hence  the  more  sensitive  d(0)  becomes  to  small  measurement 
inaccuracies. 

If  d(0)  is  constant  (for  several  values  of  the  apex  angle  a  of  our  modified 
carpenter’s  square  MCS),  then  the  region  (or  rather  the  visible  portion  of  its  peri¬ 
phery)  must  be  circular,  and  hence  actually  possesses  no  geometric  features  other 
than  its  radius.  If  d(0)  is  nearly  constant,  i.c.  if  the  ratio  of  its  largest  to  its  smal¬ 
lest  values  lies  near  1,  then  the  (visible  part  of  the)  region  will  be  nearly  circular, 
and  hence  relatively  featureless  geometrically.  Otherwise  this  ratio  will  vary  more 
substantially,  enabling  us  to  locate  anchor  points  relatively  sharply.  To  expand 
upon  this  remark,  it  is  convenient  to  consider  not  d(0)  but  its  logarithm  D(0)  ^ 


log  d(0).  By  assumption,  D(6)  varies  substantially  from  its  minimum  value  (over 
the  visible  part  of  the  periphery,  which  correspnds  to  a  range  of  angles  <=  2  ir). 
Suppose  that  the  smallest  change  in  D  that  we  feel  able  to  measure  is  a  change  €. 
Establish  a  succession  of  levels  6,6+€,8+2«,  ...  through  the  range  of  D.  For  each 
of  these  levels  8,,  divide  the  range  over  which  0  varies  into  disjoint  intervals,  each 
containing  a  point  at  which  D  takes  on  the  value  8,,  and  each  terminated  by  the 
first  occurrence  of  a  sufficiently  large  interval  in  which  D  dips  below  8^— t  or  rises 
above  8^+t.  Choose  one  representative  point  in  each  such  interval,  take  this  as  an 
anchor  point,  and  make  corresponding  entries  in  a  probe 'tree. 

To  identify  a  region  using  this  information,  we  can  subsequently  survey  it 
with  a  generalized  carpenter’s  square  of  appropriate  apex  angle  (depending  on  the 
amount  of  unobscured  periphery  available,  which  is  appropriately  measured  in 
terms  of  the  number  of  degrees  through  which  the  periphery  has  turned.)  Once 
having  measured  the  boundary  in  this  way,  find  intervals  as  above;  that  is,  inter¬ 
vals  each  of  which  contains  at  least  one  point  0  for  which  £>(0)=8^  and  terminated 
in  the  same  way  as  the  intervals  used  to  build  the  probe  tree.  Examine  these 
intervals  for  each  level  8,,  and  take  the  smallest;  this  gives  the  most  definite  infor¬ 
mation  concerning  the  location  of  the  corresponding  anchor  point.  Hien  divide 
this  interval  of  orientations  into  subintervals,  each  small  enough  so  that  no  point 
of  intersection  with  a  probe  line  can  move  by  more  than  the  standard  measure¬ 
ment  uncertainty  of  a  probe  when  the  object  turns  through  a  single  orientation 
step.  In  effect,  this  rule  defines  the  number  of  ‘micro-facets’  into  which  our  pro¬ 
cedure  must  divide  the  interval. 

Moving  through  this  range  of  orientations  by  stepping  successively  between 
the  intervals  into  which  we  have  divided  it,  take  the  point  x  of  Fig.  3,  which  lies 
between  the  point  M  and  the  apex  A  of  the  GCS,  as  an  anchor  point,  and  then 
execute  (or  simulate)  a  series  of  probes.  Tlus  will  eliminate  incorrect 
orientations/identifications,  normally  quite  rapidly,  and  leave  only  those  orienta¬ 
tions  consistent  with  the  available  data  oonoeming  the  visible  periphery. 

Note  that  the  number  of  orientations  over  which  we  need  to  search  serially 
will  be  roughly  proportional  to 


min  [1]  /  var  (D[I) 

where  I  designates  an  angular  subinterval  of  the  visible  range  of  tangent  angles 
(to  the  region  periphery),  |  I  |  is  the  size  of  this  subinterval,  and  D  |  I  desig¬ 
nates  the  restriction  of  ^e  function  D  to  the  subinterval  I.  Thus  favorable  cases 
are  those  in  which  a  substantial  part  of  the  variation  of  D  takes  place  in  some 
small  range  of  angles;  unfavorable  cases  are  those  in  which  D  varies  uniformly 
over  the  whole  of  the  visible  angular  range.  Even  in  this  unfavorable  case,  the 
range  of  angles  we  have  to  search  will  be  limited  to  a  fraction  of  the  total  angular 
range  inversely  proportional  to  D’s  total  variation. 
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The  following  additional  technique  can  be  used  to  improve  the  effidenc^  of 
the  simple  approach  just  outlined.  Suppose  that  the  function(s)  D  using  which  we 
are  trying  to  identify  and  orient  a  region  are  constant  or  nearly  constant  over 
some  substantial  portion  B  of  a  region  boundary.  Then  this  section  B  of  the 
boundary  is  likely  to  be  dose  to  circular,  and  of  a  known  radius  R.  We  can 
exploit  this  fact  by  mapping  the  visible  portion  of  the  boundary  to  a  much  smaller 
curve.  This  can  be  done  by  moving  each  of  its  points  P  a  known  distance  d  (easily 
calculated  from  the  estimated  radius  R)  perpendicularly  away  from  the  tangent 
line  at  p.  The  image  of  B  is  then  a  signiBcantly  smaller  curve  B’.  (The  image  of  a 
perfect  circle  would  plainly  be  the  unique  point  fixed  relative  to  the  drde,  i.e.  its 
center.) 


Figure  4:  Mapping  a  nearly  circular  curve  into  a  smaller  curve 
by  ‘radial’  translation  of  its  points. 

Once  B  has  been  constructed,  we  can  cover  it  with  suffidently  small  drdcs, 
which,  hopefully,  will  not  be  very  numerous.  The  centers  of  these  circles  form  a 
collection  A  of  possible  anchor  points,  in  the  sense  that  when  the  curve  is  meas¬ 
ured  (with  a  generalized  carpenter’s  square)  and  found  to  have  a  I>-value  which 
only  Umits  the  region  orientation  to  the  large  angular  interval  B,  any  point  con¬ 
structed  in  the  manner  just  explained  must  lie  very  close  to  one  of  the  anchor 
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points  in  A.) 

5.  Some  observations  concerning  geometrically  ^colored’  and  ^color- 
less*  curves  and  surfaces  in  2  and  3  dimensions 

The  issue  crucial  to  some  of  the  region  identification  techniques  outlined 
above  is  how  to  fmd  one  or  more  'anchor  points’  which  can  be  used  to  standard¬ 
ize  the  position  of  the  region.  (Similarly,  flat  sides  of  a  region  define  ‘anchor 
orientations’.)  Once  such  an  anchor  point  has  been  found,  the  identification 
problem  becomes  very  much  easier.  An  anchor  point  may  be  unique,  or,  as  in 
the  case  of  a  polygon,  many  possible  points  (vertices)  may  define  useful  anchors. 
Moreover,  anchor  points  may  be  uniquely  identified  by  geometric  invariants  asso¬ 
ciated  with  them,  or,  as  in  the  case  of  a  regular  polygon,  a  re^on  may  possess 
symmetries  and  therefore  possess  multiple  anchor  points  which  fall  into  logically 
indistinguishable  categories. 

As  we  have  noted,  as  soon  as  a  curve  is  ‘painted’  with  some  concrete  or 
abstract  ‘color’  which  has  significant  variation  along  the  curve,  it  becomes  is  easy 
to  defme  anchor  points;  it  suffices  to  take  those  points  having  some  characteristic 
color,  (but  for  this  we  want  to  pick  a  color  which  occurs  only  infrequently  on  the 
curve.)  The  rotational  invariants  occuring  in  the  preceding  discussion  give  us  a 
way  of  operating  in  situations  in  which  no  external  color  is  available,  by  forming 
noise-immune  geometric  invariants  and  using  them  as  generalized  colors.  (These 
‘geometric  colors’  are  most  naturally  associated  with  orientations  of  a  measuring 
instrument  and  thus  ean  most  naturally  be  regarded  as  painting  the  circle  rather 
than  the  region  boundary  under  investigation.  Since,  in  the  case  of  convex 
bodies,  each  orientation  maps  naturally  to  a  point  of  the  repon  boundary,  this 
viewpoint  loses  no  significant  information.) 

We  can  best  understand  the  potential  of  this  approach  by  considering  those 
situations  in  which  it  must  fail.  These  are  situations  in  which  the  boundary  curve 
being  measured  is  completely  ‘colorless’  relative  to  the  geometric  invariant  calcu¬ 
lated,  i.c.  cases  in  which  the  battery  of  invariants  we  bring  to  bear  have  constant 
values  over  the  boundary  of  the  object  being  measured.  Note  that  these  are  also 
cases  other  shape  matching  techniques  will  also  tend  to  fail,  because  the  same 
degree  of  matching  will  be  attained  by  a  large  family  of  orientations  differing  sim¬ 
ply  by  Euclidean  motions,  making  it  impossible  to  discriminate  between  these 
orientations. 

To  be  satisfied  with  a  collection  of  geometric  invariants,  we  will  want  con¬ 
stancy  of  the  geometric  invariants  used  to  ‘paint*  an  object’s  boundary  to  imply 
that  the  boundary  is  inherently  colorless  geometrically,  i.e.  to  imply  that  its  points 
are  equivalent  to  each  other  under  a  Euclidean  motion  of  the  whole  plane. 
Curves  having  this  property  must  dearly  be  orbits  of  points  under  l-parametcr 
subgroups  of  the  group  of  plane  motions,  and  hence  must  cither  be  drdes  or 
straight  lines  (note  therefore  that  if  we  can  sec  the  whole  boundary  of  an  object, 
the  circle  is  the  only  possible  colorless  curve).  Let  us  call  a  set  of  geometric 
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invariants  ample  if  any  curve  for  which  these  invariants  are  constant  over  the 
length  of  a  curve  is  necessarily  straight  or  circular.  (In  addition,  we  want  invari¬ 
ants  that  are  stable  relative  to  small  perturbations  of  a  curve,  and  which  are  local, 
allowing  them  to  be  calculated  for  nearly  the  full  angular  range  through  which  a 
partially  obscured  convex  curve  turns.)  Once  we  have  an  ample  set  of  invariants 
(also  possessing  the  other  properties  just  noted)  we  will  have  done  as  well  as  we 
can,  in  the  sense  that  invariants  better  in  any  ideal  sense  are  impossible.  Similar 
considerations  apply  to  curves  in  3-space  and  to  curves  which  lie  in  other 
geometric  objects  of  concern  to  us,  particularly  curves  on  the  sphere. 

A  similar  of  geometric  ‘colorlessness’  applies  to  curves  in  3-space  and  to  sur¬ 
faces.  A  geometrically  colorless  curve  in  3-spacc  is  either  a  straight  line,  circle,  or 
helix.  A  colorless  curve  lying  on  the  surface  of  the  sphere  is  necessarily  a  circle 
(not  necessarily  a  great  circle),  A  similar  notion  and  remark  apply  to  colorings  of 
the  sphere;  such  a  coloring  fixes  a  point  (which  can  then  be  used  as  an  anchor 
point)  unless  there  exists  a  continuous  group  of  rotations  of  the  sphere  which 
leaves  the  coloring  invariant,  i.e.  c(Rp)  =  c(p)  for  the  color  (or  colors)  c  and  every 
R  in  some  continuous  group  of  rotations.  Here  there  are  only  two  possibilities: 
cither  c  is  constant,  or  c  is  constant  on  each  of  a  family  of  parallel  circles  on  the 
sphere.  In  all  other  cases,  cither  changes  in  the  shape  of  one  of  the  level  curves 
c(p)  =  const  will  fix  a  point,  or  changes  in  the  relative  position  of  two  level  curves 
fix  such  a  point.  (For  example,  for  each  point  p  on  a  first  (drcular)  level  curve 
c(p)  -  const^  we  can  take  its  minimum  distance  to  a  second  (also  drcular)  level 
curve  c(p)  -  const2‘,  unless  the  two  curves  are  parallel,  this  function  paints  a  vary¬ 
ing  geometric  ‘color’  along  the  first  curve,  and  (assuming  infinite  preosion)  this 
color  fixes  an  anchor  point. 

Next  consider  surfaces  in  three  dimensional  space  which  are  geometrically 
colorless,  either  in  the  strong  sense  that  all  their  points  are  geometrically 
equivalent,  or  in  the  weaker  sense  that  the  surface  is  invariant  under  some  one¬ 
dimensional  continuous  subgroup  of  the  Euclidean  group.  In  the  first  case,  the 
surface  must  have  constant  prindpal  curvatures,  and  hence  must  be  a  portion 
either  of  a  plane,  sphere,  or  circular  cylinder.  In  the  second  case,  the  orbit  of 
any  point  under  the  group  of  motions  leaving  the  surface  invariant  must  be  either 
a  straight  line,  circle,  or  helix  (of  pitch  determined  by  the  group  leaving  the  sur¬ 
face  invariant).  Hence  the  surface  must  be  either  a  portion  of  a  cylinder  (not 
necessarily  circular),  a  surface  of  rotation,  or  a  ‘helical  cylinder’  (screw  surface) 
defmed  by  its  cross-section  in  a  plane  perpendicular  to  the  direction  of  the  com¬ 
mon  helix  axis. 
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Figure  5;  Example  of  a  ‘helical  cylinder’. 

All  points  of  each  helix  on  the  surface  have  equal  color. 


While  interesting  as  mathematical  examples,  helical  cylinders  of  this  kind  are 
(except  for  machine  screws  and  their  colored  equivalent,  barber  poles)  rare. 

6.  Shape  Descriptor  Matching 

Probing  methods  like  that  described  above  use  the  actual  periphery  of  the 
region  to  be  identified,  and  do  not  embody  any  global  concept  of  region  shape. 
This  contrasts  with  other  identification  techniques  that  work  from  some  abbrevi¬ 
ated  shape  descriptor  which  can  be  associated  with  the  periphery  of  a  convex 
region,  rather  than  from  the  periphery  itself.  The  stability  and  efficiency  with 
which  these  descriptors  can  be  matched  is  crucial  for  such  methods.  A  few 
mathematical  observations  can  be  made  concerning  this  fxjint.  Assume  first  that 
the  observed  region  is  expected  to  be  convex.  Such  regions  are  often  described  by 
their  ‘turning  function’  e(j),  i.c.  the  function  which,  starting  from  some  arbi¬ 
trarily  designated  point  of  its  periphery  and  proceeding  counterclockwise  around 
the  periphery,  records  the  change  in  angle  of  the  counterclockwise  tangent  as  a 
function  of  the  arc-length  s  traversed.  This  function  is  monotone  increasing,  and 
varies  through  2it  as  s  goes  from  0  to  its  final  value  S,  which  is  the  total  periphery 
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of  the  region.  The  value  S  simply  describes  the  total  size  of  the  region,  and  (if 
the  whole  periphery  is  available)  we  can  normalize  it  to  Itt,  so  that  0(j)  is  mono¬ 
tone  and  goes  from  (0,0)  to  (2'iT,2ir). 

The  function  0(j)  has  various  useful  properties: 

(1)  0(j)  is  invariant  under  any  Euclidean  motion  of  the  object  o  in  question. 

(2)  0(j)  depends  in  a  very  simple  way  on  the  starting  point  on  the  boundary  of  O, 

that  is,  if  the  starting  •  point  shifts  by  jq*  ®  undergoes  a 

correponding  horizontal  and  vertical  shift,  i.e.  simply  changes  to 

0'(j)  =  0(J  +  5o)  -  0(^o) 

We  can  still  use  the  shape  descriptor  0(j),  measured  along  the  visible  portion 
of  O’s  boundary,  even  if  O  is  partially  occluded,  i.e.  even  if  only  the  portion 
of  0  which  lies  right  of  some  (known)  directed  line  is  visible.  In  such  case 
the  graph  of  0  will  simply  be  an  (appropriately  shifted)  portion  of  the  graph 
for  the  whole  boundary  of  0. 

(3)  0  is  parametrized  by  the  arc  length  of  the  boundary  of  O,  which  can  become 
unstable  under  small  perturbations  if  convexity  is  lost.  That  is,  if  we 
represent  the  noise-corrupted  boundary  of  an  observed  object  0  sunply  as  the 
polygonal  line  passing  through  all  observed  boundary  points,  the  resulting  arc 
length  can  differ  greatly  from  that  of  the  ideal,  noise-free  object,  in  which 
case  the  shape  descriptors  for  the  observed  body  and  for  its  model  counter¬ 
part  will  not  approximate  one  another.  To  overcome  this  difficulty,  we  can 
compute  the  convex  hull  of  the  observed  data  points,  obtaining  a  convexified 
observed  object,  and  then  match  the  shape  descriptor  for  this  convexified 
observation  to  pre-stored  data  describing  various  model  convex  bodies. 

If  the  region  is  polygonal,  the  function  6(j)  is  a  step  function  whose  discon¬ 
tinuities  tend  to  be  troublesome  when  a  slightly  perturbed  measurement  of  6(s)  is 
matched  against  a  pre-stored  model.  To  avoid  tWs  problem,  one  can  simply  turn 
the  graph  of  the  function  45“  clockwise,  thereby  converting  it  into  the  graph  of  a 
revised  function  -^(1)  whose  derivative  is  bounded  by  1  in  absolute  value. 
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Figure  6(a):  Graph  of  6(j)  for  a  polygon. 


Figure  6(b):  Graph  of  0(5)  after  rotation  by  45“. 
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To  identify  an  observed  convex  region  0  (which  can  be  partially  obscured), 
we  can  then  compute  its  descriptor  e(«)  and  try  to  match  it  to  similar  shape 

descriptors  computed  for  model  regions  . corresponding  to  the  various 

2-D  objects  whidi  might  appear  in  an  observed  scene. 

In  situations  of  this  kind,  matching  is  customarily  implemented  by  means  of 
the  fast  Fourier  transform  algorithm.  More  specifically,  let  denote  the  graph 
of  boundary  turning-angle  vs.  arc  length  computed  for  a  (convexified)  observed 
polygon  O  after  this  graph  is' turned  clodcwise  by  45“  (if  O  is  obscured  by  a  line  1 
then  the  graph  of  €  should  start  at  one  point  of  obscuration  (i.e.  one  intersection 
of  1  with  the  boundary  of  o)  and  end  at  the  other  such  point).  For  each  of  the 
specified  model  objects  Oj  let  Tiy  denote  the  corresponding  (rotated)  graph  for  Oj. 

If  we  can  use  the  metric  in  shape  descriptor  space,  the  object  Oj  matching 
O  most  closely  is  that  which  minimizes  the  distance 

L 

(2)  nm  /h/(^+4o)  -  -  e(jc)F4* 

Actually,  we  find  it  better  to  change  (2)  slightly  so  as  to  find  the  best  "vertical  fit" 
between  r\j(x+d^  and  {(*),  i.e.  to  minimize 

i 

(3)  min  min  /h/(jr+4o)  - 

^  0  ■' 

In  (3)  the  best  value  of  c  is  given  by 

L 

(4)  c  =  c(d^  =  -p  -  ^(x))dx  . 

With  this  value  of  c,  (3)  becomes 

'l 

(5)  min  /h/^+^o)  -  5Wp4t  -  L|c(4q)P  = 

<*0  0 

L 

/  h/Ct)p4jf  +  /It(*)p4r  -  2/'r|,(x+<fo)€(*)dx  -  I,|c(<fo)P  , 

1  «  J 

where  in  the  third  integral  in  the  last  form  of  (5)  we  take  ^  to  be  defined  as  zero 
outside  the  interval  [0,L].  This  allows  the  minimum  appearing  in  (5)  to  be  rewrit¬ 
ten  as 

L 

(6)  min  (  //4o  +  "  ^y(^o)  "  i  I 

^  0 

L 

+  /I5(-')P4*  -  ljy\j(x+d^^(x)dx  )  , 

0 


where 
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(7) 


//(‘O  == 

^  0 


«y(rf)  =  S^y\j(x)dx . 

Since  the  most  expensive  part  of  the  computation  (6)  is  simply  a  convolution,  it 
follows  that,  after  discretizatioD  to  n  interpolating  points,  we  can  calculate  the 
miniTTiiim  (6)  (foT  cach  j  Separately)  in  time  0(n  log  n),  usmg  the  fast  Fourier 
transform  technique. 


Direct  Match  of  Rotated  2-D  Objects 

A  cruder  but  stabler  and  still  quite  effective  2-D  shape  matching  scheme  can 
also  be  implemented  efficiently  using  the  fast  Fourier  transform.  In  this  method, 
we  simply  take  a  sequence  of  points  equally  spaced  along  the  perimeter  of  a  (con- 

vexified)  observed  polygon  O.  More  precisely,  we  take  a  sequence  (uj . «„)  of 

points  in  clockwise  order  along  the  boundary  of  the  convex  hull  of  0  such  that  all 
the  arcs  between  successive  points  Uf  and  have  equal  lengths  (which  must 
therefore  be  S/n,  where  5  is  the  total  length  of  the  periphery  of  o).  We  then  wish 
to  match  two  such  sequences  and  (▼/)"-!  corresponding  to  an  observed 

(convexified)  object  0  and  a  model  object  M  respectively.  Assume  first  that  the 
whole  boundary  of  0  is  visible.  Matching  amounts  to  finding  a  Euclidean  motion 
E  of  the  plane  which  will  minimize  the  L2  distance  between  the  sequences 
and  i-e.  we  need  to  compute 

n 


To  simplify  this  calculation,  first  translate  O  so  that  its  centroid  lies  at  the  origin, 
giving 

i»y  =  0 

/-I 

Next  write  £  as  En  =  i?gO  +  a,  denoting  a  oounterdockwise  rotation  by  e.  Then 

A  =  rain  21^8®/  ~  v,p  = 


min 

e.a 


n 

b-i 


2  |v,p  +  n|ap  -  22  a-Vy  +  2  loyP  +  2^  -  2^ 


y-i 


y-i 


^0“y' 


But 

2  ^  ■•^e(2®y)  =  0  • 

Hence  a  and  e  appear  independently  in  A  and  we  can  minimize  their  contributions 
separately. 
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To  minimize  over  a  simply  put 


1  " 
n  y.i 


As  to  e,  we  need  to  compute 


8  =  raa*  2 


Regarding  the  vectors  Uy,  ?y  as  complex  numbers  uj,  vj,  we  can  rewrite  this  as 


8  =  max  Re 
e 


l;-i 


'^Yy 


=  IS  “/yl 


y-1 


Altogether  this  gives 


A  =  2  -  7I2  VyP  +  2  l";F  -  212  -yivl 

;-i  ^  « y-1  y-1  y-1 

A  =  2  I’yP  -  ^12  VyP  +  2  layp  -  2(1  2  "/»y  P  +  I  2  »yX'y  p]  '  . 


(•) 


where  dxv  denotes  the  (2-dimcnsional)  cross  product  of  the  vectors  o  and  v. 
(Note  the  similarity  between  (*)  and  the  formula  for  the  best  matching  between 
two  turning-angle  shape  descriptors  given  in  the  preceding  section.) 

If  O  is  partially  occluded  or  appears  in  an  unknown  orientation,  we  have  to 
match  the  sequence  (ay)J'.i  to  each  of  the  contiguous  subsequences  of  the 

(circular)  sequence  (vy)^p  for  d  ~  0,  .  .  .  ,m—l.  (We  assume  that  m  ^  n,  for  other¬ 
wise  the  (partial)  periphery  of  O  is  too  long  to  match  M.) 


For  each  such  d  (*)  becomes 

A(<0  =  2  kyp  “"“IS  »yp  2  l"yp  “  ^12  ^j''j+d\ 

j-d+i  "  y-d+i  y-1  y“i 

As  in  the  preceding  analysis,  the  niinimiim  of  the  values  A(J),  d  =  0,  .. .  ,m—l,  can 
be  found  in  time  o(m  log  m),  using  the  fast  Fourier  transform. 

It  is  interesting  to  note  that  the  observations  made  in  the  last  few  pages  gen¬ 
eralize  easily  to  curves  in  three  dimensions,  or,  more  generally,  to  any  situation 
in  which  a  model  curve  or  surface  o(<i))  depending  on  one  or  more  parameters  u 
must  be  rotated  and  translated  to  match  a  model  curve  or  surface  v(u»)  as  well  as 
possible.  We  need  to  assume,  however,  that  the  matching  operation  involves  no 
change  in  parametrization  for  either  of  the  functions  o(<i))  or  ▼(<•)). 

Suppose  more  specifically  that  we  are  given  two  descriptor  functions  o(o)), 
v(w),  tatS,  corresponding  respectively  to  an  observed  object  O  and  a  model  object 
M.  We  need  to  find  the  Eudidean  motion  E  (c.g.  of  3-spaoe)  which  minimizes 
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As  in  the  2-D  case,  we  translate  0  so  that  its  centroid  lies  at  the  origin,  giving 

J  n(ci»)  J(i)  =  0 
5 

Write  E  as  Ea  =  Ru  +  B,  where  R  is  a  rotation.  Then 

'  A  =  min  f  |/ta(a))  +  ■  —  v((«))P  dm  — 

■  s 

min  f  lv(w)P  dm  +  |5||ap  -  2  /  a-vCa))  dm  +  J  |o(a))p  dm  +  2f  B  Ra(m)  dm  -  2  f  Ru(m)  v(m)  dm 

[  5  s  S  s  s 

But 

J*  B'/ta((>))  dm  =  b  R(J  u(a))  rfo)  )  =  0  . 

5  s 

Hence  a  and  R  appear  independently  in  A  and  we  can  minimize  their  contributions 
separately. 

To  minimize  over  a  simply  put 

As  to  /?,  we  need  to  compute 

8  =  max  /  /?u((i))-v(a))  dm 
^  s 

To  find  8,  first  calculate  the  matrix  A  given  by 

Afj  =  /  n.(u)Vy((i>)  dm  , 

S 

(where  ij  =  1,2,3  if  we  are  dealing  with  a  curve  or  surface  in  3-space).  In  terms  of 
the  matrix  A  we  can  express  8  as 

8  *  max  tr(RA) 

R 

To  maximize  triRA),  decompose  A  as  A  =  Qfi,  where  Q  =  A(A*i4)  ^  is  a  pure  rota- 
tion  and  //  =  (A*A)2  is  positive  definite  symmetric.  This  gives 

Jl 

8  =  max  tr(RQE)  =  max  tr(RH)  =  tr(If)  =  /r((A*A)^) 

R  R 


To  sec  this,  note  that  since  the  trace  is  invariant  under  rotation,  we  can  assume 
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that  H  is  diagonal.  But  for  a  diagonal  positive  definite  matrix  (X,)  and  a  rotation 
matrix  (r^)  the  trace  of  the  product  2  can  be  no  larger  than  2  and  can 
assume  this  value  only  when  r,y  “  8^^. 

Overall,  we  have 

A  =  f  |v(a))p  dta  — rlrl/  ^(w)  </ti)P  +  /  |a(“)P  -  2  tr((A*A)2)  (*  ) 

s  Pl  5  S 


showing  that  the  optimal  rotated  match  between  0  and  M  can  be  found  in  time 
proportional  to  that  needed  to  integrate  the  various  functions  appearing  in  (••), 
i.e.  proportional  to  the  number  of  data  points  used  to  discretize  the  curves  or  sur¬ 
faces  u  and  V. 

Much  as  in  the  2-D  case  considered  previously,  these  formulae  can  be  used 
to  match  observed  3-D  curves  parametrized  by  arc  length  to  similarly 
parametrized  model  curves.  Matching  can  be  achieved  in  time  0(n  log  «)  by  using 
the  fast  Fourier  transform,  even  if  the  observed  curve  O  is  partially  obscured. 
This  remark  is  potentially  applicable  to  matching  of  ‘iso-color*  curves  on  3- 
dimensional  surfaces. 

There  are  two  difficulties  in  extending  the  matching  technique  just  described 
to  partially  obscured  surfaces.  The  first  difficulty  is  to  parametrize  O.  This  point 
is  discussed  belowj  but  the  obvious  parametrization  using  the  centroid  that  can  be 
used  in  the  unobscured  case  is  not  available  for  partially  obscured  objects. 

A  second  difficulty  involves  the  computational  cost  of  matching.  Suppose 
that  the  centroid  of  o  (or  some  other  anchor  point  common  to  both  o  and  A/)  is 
not  known  because  O  is  partly  obscured.  Then  we  have  to  match  the  visible  por¬ 
tion  of  O’s  surface  against  all  possible  similar  portions  of  A/’s  surface,  and  that 
may  force  us  to  iterate  over  (an  appropriate  discretization  of)  the  3-D  rotational 
group  Ry  This  means  that  if  we  discretize  ^3  into  points,  and  for  purpose  of 

integration  discretize  S  into  points,  we  end  up  with  an  0(n^)  matching  pro¬ 
cedure,  far  too  slow  to  be  useful.  What  is  missing  here  is  an  appropriate  generali¬ 
zation  of  the  discrete  fast  Fourier  transform  algorithm  to  the  case  of  (a  discre¬ 
tized  form  oO  the  group  Oy  Even  so  the  complexity  can  be  reduced  to  Ofn^log  n) 
by  using  the  standard  fast  Fourier  transform  to  handle  all  n  members  of  which 
transform  the  north  pole  of  S  to  the  same  point  on  S,  all  simultaneously.  To  do 
better  than  this,  a  generalization  of  the  fast  Fourier  transform  which  can  give 
some  rapid  way  of  evaluating  integrals  on  the  sphere  is  needed. 
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7.  Numerical  Experiments 

The  plausibility  of  the  2-D  matching  schemes  suggested  above  can  be  con¬ 
firmed  by  simple  numerical  simulations.  For  such  simulations  we  begin  by  gen¬ 
erating  noisy  star-shaped  objects,  representing  hypothetical  ‘measurements’. 
These  are  generated  by  taking  ideal  convex  objects,  and  perturbing  some  specified 
number  of  points  on  each  of  their  sides  by  adding  artificial  randomized  noise  to 
their  distance  from  the  object  centroid;  this  noise  ranges  between  1  -  noise^const 
and  1  +  noise_constf  where  noue^const  is  a  specifiable  parameter  controlling  the 
amount  of  noise  applied.  A  line  of  obscuration  can  also  be  specified  for  each 
simulation  run,  in  which  case  all  points  of  the  polygon  lying  to  the  right  of  this 
line  to  be  omitted  from  the  simulated  measurement.  Each  generated  object  0  is 
then  matched  against  a  collection  of  ideal  convex  objects,  including  the  one  from 
which  0  has  been  generated  by  appl5nng  the  above  random  perturbation. 

Each  of  the  two  preceding  heuristic  matching  schemes  has  been  simulated. 
When  the  matching  algorithm  finds  that  two  or  more  ideal  objects  have  nearly  the 
same  shape-descriptor  distance  from  a  shape-descriptor  it  reports  the  error  or 
ambiguity  in  specific  terms. 

Easy  simulations  have  been  run  with  the  library  of  convex  objects  shown  in 
the  following  figure.  The  results  of  these  simple  experiments  arc  encouraging. 
Both  matching  schemes  described  identify  the  correct  model  object  in  almost  all 
trials.  Exceptions  occur  in  cases  when  an  observed  object  is  obscured  in  a  way 
which  made  its  visible  portion  similar  to  a  portion  of  another  model  object,  or 
when  the  degree  of  random  noise  was  hi^  enough  to  confuse  the  measured 
object  with  a  visually  similar  but  different  model  object  (e.g.  a  circle  measured 
with  20  percent  noise  may  be  identified  as  an  oval).  The  two  matching  heuristics 
yield  similar  results,  but  in  the  presence  of  large  (quantities  of  noise  the  second 
technique  more  reliably  avoids  the  grotesque  misidentifications  that  begin  to 
plague  the  first  method.  The  following  figures  are  generated  using  the  second 
matching  scheme.  Note  that  in  each  case  the  matching  operation  successfully 
identifies  the  figure  presented  to  it,  from  among  all  the  other  figures  belonging 
to  the  small  shape  library  shown  in  Figure  7,  using  only  those  unobscured  boun¬ 
dary  portions  indicated  in  Figures  8(c)  (resp.  8(f)). ^ 


2  The  authon  would  like  to  thank  Charles  Kim  for  assistance  with  the  simulations  described 
in  this  section. 
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Figure  8(a):  A  test  oval  and  its  roughened  form 


Figure  8(b):  Adjusted  convex  hull  of  test  oval 


8.  Additional  remarks  on  the  turning-angle  shape  descriptor;  some 
remarks  on  texture;  non-convex  regions 

Since  the  derivative  of  the  rotated  graph  11(5)  derived  from  a  region’s  ‘turning 
function’  0(j)  is  bounded  by  1  in  modulus,  the  Fourier  scries  of  ti(j)  converges  to 
■n(j)  with  relative  rapidity,  making  it  possible  to  use  the  first  few  terms  of  this 
series  as  descriptors  of  the  overall  shape  of  the  region.  (The  adequacy  of  such  an 
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abbreviated  description  can  be  assessed  by  regenerating  the  figure  from  these 
Fourier  coefficients,  and  then  noting  what  differences  with  the  original  region  the 
eye  picks  out.) 

To  judge  the  limitations  of  this  abbreviation,  polygon  is  regular,  then  the 
function  e(«)  moves  in  alternate  horizontal  and  vertical  steps  which  must  be  equal 
in  size  (assuming  normalization  of  the  total  arc  length  to  21:).  Hence  -hCj)  is  as 
shown  in  the  following  figure: 


Figure  8(d):  Complete  match  of  oval  to  visible  convex  hull  section 
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Figure  9:  Hic  function  ii(x)  for  an  n-sided  regular  polygon 

A  (unit)  circle  would  have  a  perfectly  flat  ii(j)  function.  Tliis  comparison 
shows  that  the  Ti(j)-function  for  an  n-sided  polygon  approximates  that  for  a  circle 
very  closely  in  the  uniform  norm,  but  that  it  has  a  significantly  different 
geometric  texture.  To  reflect  this  fact,  a  shape-matching  scheme  would  have  to 
apjply  some  operator  which  will  detect  texture.  The  following  is  one  possibiity; 
decompose  the  function  y\  into  parts  corresponding  to  different  frequency  ranges 
by  applying  a  disjoint  set  of  band-pass  filters,  (These  can  decompose  -n  into  its 
low  frequency  part,  encompassing  all  frequencies  up  to  a  limit  F^,  and  into  then 
exponentially  expanding  frequency  ranges  to  ^2  gives 

’l(^)  “  ^i(*)  +  tijCj)  +  T|3(x)  +  •  •  • ,  where  relatively  few  terms  need  appear.  The 
low-frequency  component  can  be  represented  exactly  by  a  few  Fourier  coeffi¬ 
cients,  after  which  each  of  the  few  hi^er-frequency  components  1)2, 113,  ...  can  be 
handled  as  follows:  Calculate  the  variation  of  each  such  •r\j  over  the  range  from  0 
to  s.  This  defines  a  monotone  increasing  function  8^;  treat  this  as  previously,  i.e. 
turn  it  45®  and  represent  it  by  its  lowest  few  Fourier  terms.  The  functions  8y  then 
represent  the  way  that  the  texture  of  the  original  turning  function  0(j)  varies  from 
zone  to  zone  along  the  boundary  of  the  region  being  analyzed.  An  approach  like 
this  might  be  able  to  represent  the  shape  and  texture  of  a  convex  body  adequately 
using  something  like  25-35  numerical  parameters:  c.g.  4  sines  and  5  cosines  to 
represent  each  of  and  ■n2>  2  sines  and  3  cosines  for  TI3,  which  will  normally 

be  much  less  significant  to  the  eye. 
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As  in  the  simpler  case  noted  above,  we  can  assess  the  adequacy  of  these 
descriptors  by  generating  and  inspecting  the  simplest  curves  which  these  descrip¬ 
tors  fail  to  distinguish  within  a  variety  of  examples  presented  to  the  analysis  sys¬ 
tem.  One  obvious  shortcoming  is  that  the  proposed  sdieme  misses  periodicities 
which  the  eye  can  pick  out,  e.g.  it  does  not  distinguish  the  function  appearing 
in  the  last  preceding  figure  from  the  function  which  is  different  to  the  eye. 


Figure  10:  The  function  -hCj)  for  an  />-sided  but  not  entirely 
regular  polygon 


This  remark  may  be  more  pessimistic  than  is  justified,  since  we  deal  here  with 
small  edges  on  nearly  circular  polygons  with  numerous  sides;  nevertheless,  only 
experiments  exhibiting  the  strengths  and  weaknesses  of  the  scheme  proposed  can 
establish  the  validity  of  more  refined  suggestions. 

In  concluding  this  section  we  note  that  non-convex  regions  turn  out,  some¬ 
what  surprisingly,  to  be  easier  to  handle  than  convex  regions.  Every  concavity  is 
bounded  by  a  single  straight  side  of  the  convex  hull  of  the  body,  which  can  be 
called  the  entrance  to  the  concavity;  if  the  region  is  partially  obscured,  the  concav¬ 
ity  ran  be  said  to  have  a  correctly  visible  entrance  if  no  point  of  obscuration  lies  on 
the  opposite  side  of  the  entrance  from  the  visible  points  of  the  region. 
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Rgure  11:  Concavity  entrances  (The  left  one  is  correctly  visible, 
whereas  the  right  one  is  not.) 


A  correctly  visible  concavity  entrance  identifies  a  straight  side  of  the  convex  hull 
of  the  region  to  which  it  belongs,  and  since  the  hull  has  at  least  this  one  straight 
side  we  can  treat  it  as  ‘partially  polygonal’,  i.e.  can  take  the  polygon  positions  in 
which  one  such  straight  side  is  horizontal  as  the  basis  for  the  search  tree  used  to 
identify  all  regions  having  a  concavity  with  a  correctly  visible  entrance.  Moreover, 
any  concavity  with  a  correctly  visible  entrance  can  be  considered  to  constitute  a 
region  (which  may  be  partially  obscured)  in  its  own  right;  any  technique  applica¬ 
ble  to  (partially  obscured)  regions  can  be  applied  recursively  to  such  a  concavity. 
In  particular,  whenever  the  whole  boundary  of  a  concavity  is  visible,  we  can  find 
its  centroid  and  can  use  this  as  an  anchor  point  for  the  region. 

9.  Three  Dimensional  Bodies 

Having  now  said  a  good  deal  about  the  2-D  case,  we  can  try  to  extend  our 
considerations  to  the  more  challenging  but  more  significant  case  of  bodies  in  three 
dimensions.  Let  us  begin  by  considering  the  probing  technique,  and  first  its  appli¬ 
cation  to  the  simplest  case,  that  of  a  convex  polyhedron  standing  on  one  of  its 
faces.  It  will  generally  be  easy  to  find  two  points  p^,  fixed  in  the  body,  e.g.  we 
can  form  silhouettes  of  the  object  as  viewed  from  the  x,  y,  and  z  directions  and 
use  this  data  to  locate  the  topmost  point  and  the  point  with  largest  z-eoordinate 
(note  that  we  need  the  3-space  locations  of  both  these  points).  This  will  be  deter¬ 
minable  from  the  three  silhouettess  unless  the  topmost  (resp.  rightmost)  edge  or 
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face  of  the  polybedroa  b  parallel  to  the  xy  (resp.  yz)  plane,  in  which  case 
appropriate  technical  adjustments  need  to  be  made.  Once  these  points  have  been 
found  we  can  logically  rotate  and  translate  the  body  to  put  these  two  points  in 
some  standard  position;  then  the  object  b  necessarily  in  one  of  some  finite 
number  of  possible  positions  (the  numtxsr  of  these  positions  being  roughly  propor¬ 
tional  to  the  number  of  faces  of  the  polyhedron)  and  probes  using  a  single-point 
depth  sensor,  organized  in  a  ‘probe  tree*  as  before,  should  identify  the 
polyhedron  without  difficulty. 

A  similar  technique  can  be  used  even  if  the  polyhe^on  does  not  necessarily 
stand  on  a  known  face.  From  three  silhouettes,  we  can  find  the  topmost  and 
bottom-most  points  of  the  polyhedron,  plus  the  points  farthest  left  and  farthest 
right.  These  define  body  position  up  to  one  of  a  finite  number  of  fixed  positions. 
(Similarly,  if  a  horizontal  face  is  topmost  or  bottommost,  this  fact,  plus  the  loca¬ 
tion  of  the  leftmost  and  rightmost  points  of  the  polyhedron,  determines  its  posi¬ 
tion  up  to  finitely  many  possibilities.)  Then  we  can  probe  as  before  to  complete 
identification  and  orientation  of  the  polyhedron. 

Neither  of  these  techniques  depends  on  polyhedron  convexity.  Indeed,  either 
technique  can  be  regarded  as  a  way  of  orienting  the  convex  hull  of  a  non-oonvex 
polyhe^on.  Moreover,  as  soon  as  the  position  of  its  convex  hull  b  limited  to  a 
finite  set,  the  possible  positions  of  the  polyhedron  itself  become  equally  limited, 
and  probing  can  be  used  in  the  ordinary  way  to  complete  its  identification. 

Next  suppose  that  only  part  of  a  polyhedron  is  vbiblc.  ff  thb  part  includes  at 
least  one  comer,  thb  comer  and  an  edge  running  from  it  can  be  used  to  anchor 
the  polyhedron,  following  which  we  can  apply  much  the  same  probing  technique 
as  was  described  for  partly  obscured  polygons. 

To  extend  the  probing  idea  to  3-D  objects  with  smoothly  curved  boundaries, 
we  need  to  find,  not  just  one  anchor  point  (as  in  the  2-D  case),  but  two  anchor 
points  (or  one  anchor  point  and  one  ‘anchor  direction’  emerging  from  it)  which 
have  known  position  relative  to  the  object.  Once  these  points  are  fixed,  the 
object  O  b  only  free  to  turn  about  the  axb  defined  by  these  two  points,  so  that  by 
probing  along  a  circle  perpendicular  to  thb  axb  until  contact  b  made  with  the 
body  surface,  we  can  restrict  o’s  possible  orientations  to  a  finite  set.  After  thb  b 
achieved,  the  probe  tree  method  can  be  used  to  complete  the  identification  of  o. 
Note  also  that,  once  a  single  anchor  point  p  for  has  been  located,  finding  a 
second  anchor  point  q  wUl  generally  reduce  to  a  relatively  easy  2'dimensional 
problem.  If,  for  example,  a  sufficiently  large  portion  of  O’s  surface  b  vbiblc,  we 
can  form  the  intersection  of  0  with  a  sphere  of  appropriate  radius  about  p,  and 
let  q  be  a  point  whose  position  in  the  resulting  curve  C  b  fixed;  for  example,  q 
can  be  the  centroid  of  C.  When  too  little  of  C>  b  vbible  for  thb  approach  to  work, 
another  possibility  b  to  match  C  to  an  appropriate  pre-stored  model  curve  using 
the  second  of  the  fast  matching  techniques  described  in  Section  X. 
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10.  3*D  Object  Recognition  by  Shape  Descriptors 

To  apply  a  shape-descriptor  approach  we  must  consider  generalizations  of  the 
2-D  matching  schemes  presented  above  to  the  3-D  case.  When  the  whole  of  o  is 
visible,  then  an  advantageous  parametiization  becomes  possible.  Ibat  is,  we  can 
take  c  to  be  the  centroid  of  Oy  and  parametrize  the  points  on  its  boundary  by  the 
orientation  of  the  ray  connecting  them  to  c.  This  parametrization  is  relatively 
immune  to  noise.  Details  are  as  follows.  Let  <7  be  a  convex  3-D  object  (if  O  is  an 
observed  object  we  assume  if  to  be  wholly  visible).  The  shape  descriptor  that  we 
want  to  use  for  O  is  simply  a  collection  of  data  points  (n((i)))  on  its  boundary, 
where  each  point  n(6))  is  parametrized  by  the  orientation  w  of  the  ray  connecting 
the  centroid  c  of  c?  with  u;  in  other  words,  this  shape  descriptor  is  a  3-D  vector 
function  defined  on  the  unit  sphere  S  (i.e.  a  generalized  ‘coloring*  of  the  sphere). 

If  (7  is  partially  obscured  its  centroid  cannot  be  determined,  but  instead  we 
can  use  any  other  ‘anchor  point’  having  fixed  location  relative  to  the  visible  part 
of  (7’s  surface  as  the  center  for  an  angle-based  parametrization.  Once  such  a 
fixed  parametrization  of  O’s  surface  is  available,  the  matching  technique  described 
previously  may  become  available;  but  note  the  caverns  concerning  effident^  which 
have  been  expressed.  Note  also  that  the  parametrization  just  outlined  is  also 
applicable  to  the  case  of  nonoonvex  3-D  surfaces,  provided  that  these  surfaces  are 
at  least  star-shaped  with  respect  to  some  ‘andior  point’. 

11.  Polyhedron  Recognition  Using  Silhouettes 

Given  presently  available  sensors,  object  silhouettes  can  be  formed  more 
sharply  and  rapidly  than  depth  images.  For  this  reason,  it  is  worth  considering 
the  extent  in  which  the  silhouettes  of  polyhedra  can  be  used  to  identify  them.  To 
this  end,  the  following  remarks  on  silhouettes  will  be  helpful.  Suppose  that  a  con¬ 
vex  polyhedron  P  is  given  a  certain  orientation  in  3-space,  and  projected  upon  a 
plane  Q  parallel  to  the  xz  plane  which  lies  entirely  on  one  side  of  P.  Given  any 
such  orientation,  one  group  of  P\  faces  will  be  visible  from  0,  while  its  other 
faces  will  be  obscured  by  the  body  of  P.  The  boundary  between  the  visible  and 
the  invisible  portions  is  a  sequence  of  edges  of  /*,  which  we  will  call  the  3- 
silhouette  of  P;  the  projection  onto  the  jz-plane  of  the  3-silhouette  bounds  the  ordi¬ 
nary  2-D  silhouette  of  P,  which  is  always  a  convex  polygon.  If  we  assume  that  no 
face  of  /*  is  orthogonal  to  the  jc-planc,  then  just  one  point  p  of  the  3-silhoucttc 
projects  onto  each  point  g  of  its  2-D  silhouette,  and  p  varies  continuously  with  q. 
Hence  the  3-silhouette  is  topologically  a  circle,  and  therefore  divides  the  surface 
of  p  into  exactly  2  groups  of  faces,  each  of  which  must  be  connected.  The 
silhouette  of  a  convex  polyhedron  P  is  therefore  the  projection,  on  the  camera’s 
image  plane,  of  a  closed  sequence  of  edges  on  P. 

Suppose  we  draw  the  outward-directed  normal  n  to  a  given  face  F  of  P.  Then 
F  is  visible  from  Q  if  n  points  toward  (?,  but  obscured  by  the  body  of  /*  if  n  points 
away  from  Q.  To  understand  how  the  3-silhouette  of  p  varies  as  we  rotate  p 
about  a  vertical  axis,  it  is  convenient  to  project  all  the  normals  n  to  P’s  faces  f 
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onto  the  xy  plane.  This  forms  a  ‘direction  diagram’  consisting  of  unit  vectors  in 
the  plane,  and  then  a  face  is  visible  from  z  if  the  corresponding  projected  nor¬ 
mal  points  toward  one  side,  say  the  negative  side,  of  the  y  axis,  Init  is  invisible 
otherwise.  Thus  the  edge  separating  two  adjacent  faces  and  belongs  to  f’s 
3'Silhouette  if  and  only  if  the  projected  normal  vectors  to  FI  and  F2  point  into 
opposite  sides  of  the  y  axis. 

Take  an  edge  of  the  silhouette  and  its  two  extremities  u^,  M2.  These  are  pro¬ 
jections  of  corresponding  polyhedron  vertices  vj,  which  therefore  lie  along  two 
known  lines  in  space.  Taking  the  eye  of  the  camera  to  be  the  origin  of  coordi¬ 
nates,  we  can  therefore  write  V2  =  where  a^  a2  are  known  unit  vec¬ 

tors.  Let  M2M3  be  the  next  edge  of  the  silhouette.  Ignoring  exceptional  positions, 
this  must  correspond  to  a  polyhedron  edge  V2V3,  and  again  we  have  vj  =  where 
a3  is  a  known  unit  vector.  It  will  be  noted  below  that  the  maximal  number  of  dis¬ 
tinct  3-sUhouettes  that  F  can  have  is  (O(n^)  if  we  consider  only  isometric 

silhouettes),  and  in  fact  the  maximal  number  of  distinct  triplets  v^,  V2,  V3  of  adja¬ 
cent  vertices  of  in  a  silhouette  is  also  at  most  0(n^)  (0(n?)  in  the  isometric  case). 
Hence  (searching  as  always  over  a  finite  number  of  possibilities)  we  can  suppose 
that  the  three  distances  =  |v^v2|,  D2  =  ^2^3!,  O3  =  are  known.  This  gives  us 
three  quadratic  equations  for  determining  the  three  unknowns  x,  y,  z,  namely 

J.2  +  3,2  _  2iy(v^-V2)  “  Dl 
y^  +  lyziv^v^  =  £>2 
z2  -I-  x2  -  2rr(v3  Vi)  =  Oj 

These  can  readily  be  solved  by  subtracting  suitable  multiples  of  the  third  equation 
from  the  first  two,  which  gives  two  inhomogeneous  quadratic  equations  for  the 
ratios  t  =  x/z  and  r\  =  y/z  .  Thus  knowing  the  positions  of  these  successive  vertices 
on  the  perimeter  of  the  silhouette  determines  the  polyhedron  orientation  up  to  a 
finite  number  of  possibilities,  and  hence  determines  the  entire  silhouette  in  the 
same  sense.  If  the  silhouette  has  four  or  more  vertices  we  should  therefore  be 
able  to  compare  a  finite  collection  of  calculated  silhouettes  with  an  actual 
silhouette,  and  this  will  often  identify  the  body  and  its  orientation  uniquely. 
Identification  becomes  even  easier  if  we  assume  that  silhouettes  viewed  from  two 
slightly  different  angles  are  available;  we  leave  it  to  the  reader  to  work  out  the 
details  involved. 

As  usual,  the  situation  is  somewhat  more  favorable  if  the  polyhedron  and  its 
silhouette  are  nonconvex.  In  such  case  each  vertex  of  the  silhouette  which  has  the 
property  that  the  silhouette  lies  on  the  smaller  side  of  the  two  silhouette  edges 
forming  the  silhouette  boundary  (‘convex  comers’)  must  correspond  to  a  comer  of 
the  polyhedron.  Moreover,  any  segment  connecting  two  such  points  which  does 
not  form  part  of  the  silhouette  boundary  must  be  the  image  of  an  edge  (‘flying 
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edge’)  which  connects  two  comers  of  the  polyhedron  but  is  not  an  edge  of  the 
polyhedron.  Often  this  observation  will  make  It  possible  to  identify  silhouette  ver¬ 
tices  rapidly.  Consider,  for  example,  the  union  of  parallelipip^  and  pyramid 
shown  in  the  following  figure: 


u 


Figure  12:  A  non-convex  polyhedral  object 
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Figure  13:  Silhouette  of  a  non-convex  polyhedral  object 


In  the  silhouette  the  point  «  must  correpond  to  v  since  v  is  the  only  polyhedron 
vertex  connected  to  more  than  one  other  vertex  by  a  flying  edge. 

To  show  that  for  any  convex  polyhedron  P  there  exist  at  most  0{n^)  distinct 
silhouettes  and,  in  the  case  of  isometric  projections,  only  0(n^)  silhouettes,  we  can 
argue  as  follows.  Consider  the  isometric  case  first.  An  isometric  silhouette  is 
uniquely  determined  by  the  direction  in  which  P  is  projected,  and  hence  in  turn 
by  a  point  on  the  unit  sphere.  As  already  noted,  a  silhouette  changes  its  com¬ 
binatorial  structure  only  when  it  is  projected  in  a  direction  ?  at  which  some  face  F 
of  P  is  seen  end-on;  in  other  words,  only  when  ▼  is  perpendicular  to  the  normal 
of  F,  But  for  each  face  F,  the  locus  of  orientations  v  which  satisfy  this  condition 
is  a  great  circle  on  the  surface  of  the  unit  sphere.  Since  P  has  n  faces,  this  defines 
n  su^  great  drcles  which  collectively  partition  the  sphere  into  0(n^)  open  regions, 
inside  each  of  which  the  combinatorial  structure  of  P’s  silhouette  remains  con¬ 
stant. 

Next  consider  silhouettes  seen  from  an  arbitrary  viewpoint  z.  In  this  case 
the  combinatorial  structure  of  a  silhouette  can  change  only  when  z  lies  on  one  of 
the  face  planes  of  P.  These  n  planes  decompose  the  3-D  space  exterior  to  p  into 
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0(n^)  regions,  within  each  of  which  the  combinatorial  structure  of  the  silhouette 
remains  constant,  as  asserted. 
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ABSTRACT.  A  theory  is  pre.sented  of  how  recognition  categories  can  be  learned 
in  response  to  a  temporal  stream  of  input  patterns.  Interactions  between  an  attentional 
subsystem  and  an  orienting  subsystem  enable  the  network  to  .self-stabilize  its  learning, 
without  an  external  teacher,  as  the  code  becomes  globally  self-consistent.  Category  learn¬ 
ing  is  thus  determined  by  global  contextual  information  in  this  system.  The  attentional 
subsystem  learns  bottom-up  codes  and  top-down  templates,  or  expectancies.  The  internal 
representations  formed  in  this  way  stabilize  themselves  against  recoding  by  matching  the 
learned  top-down  templates  against  input  patterns.  This  matching  process  detects  struc¬ 
tural  pattern  properties  in  addition  to  local  feature  matches.  The  top-down  templates 
can  also  suppress  noise  in  the  input  patterns,  and  can  subliminally  prime  the  network 
to  anticipate  a  set  cjf  input  patterns.  Mismatches  activate  an  orienting  subsystem,  which 
resets  incorrect  codes  and  drives  a  rapid  search  for  new  or  more  appropriate  codes.  As 
the  learned  code  becomes  globally  self-consistent,  the  orienting  subsystem  is  automatically 
disengaged  and  the  memory  consolidates.  After  the  recognition  categories  for  a  set  of  input 
patterns  self-stabilize,  those  patterns  directly  access  their  categories  without  any  search 
or  recoding  on  future  recognition  trials.  A  novel  pattern  exemplar  can  directly  access  an 
established  category  if  it  shares  invariant  properties  with  the  set  of  familiar  exemplars  of 
that  category.  Several  attentional  and  nonspecific  arousal  mechanisms  modulate  the  course 
of  search  and  learning.  Three  types  of  attentional  mechanism — priming,  gain  control,  and 
vigilance — are  distinguished.  Three  types  of  nonspecific  arousal  are  also  mechanistically 
characterized.  The  nonspecific  vigilance  process  determines  how  fine  the  learned  categories 
will  b<-.  If  vigilance  increases  due.  for  I'xample.  to  a  negative  reinforcement,  then  the  system 
automatically  .searches  for  and  learn.'  finer  recognition  categories.  The  learned  top-down 
ex])eri  ancies  become  more  aVjstract  a.'  the  recognition  categories  become  Vjroader.  The 
learned  code  is  a  property  of  network  interactions  and  the  entire  history  of  input  pattern 
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presentations.  The  interactions  generate  emergent  rules  such  as  a  Weber  Law  Rule,  a  2/3 
Rule,  and  an  Associative  Decay  Rule.  No  serial  programs  or  algorithmic  rule  structures 
are  used. 

1.  Introduction:  Self- Organization  of  Recognition  Categories.  A  fundamental 
problem  of  perception  and  learning  concerns  the  characterization  of  how  recognition  cat¬ 
egories  emerge  as  a  function  of  experience.  When  such  categories  spontaneously  emerge 
through  an  individual's  interaction  with  an  environment,  the  processes  are  said  to  undergo 
self-organization  [l].  A  theory  of  how  recognition  categories  can  self-organize  is  outlined 
in  this  report,  which  summarizes  the  model’s  design  and  mathematical  analysis,  developed 
in  other  articles  [2-4].  In  those  articles,  the  adaptive,  resonance  theory  is  also  related  to 
recent  data  about  evoked  potentials  and  about  amnesias  due  to  malfunction  of  medial 
temporal  brain  structures.  Results  of  evoked  potential  and  clinical  studies  suggest  which 
macroscopic  brain  structures  could  carry  out  the  theoretical  dynamics.  The  theory  also 
specifies  microscopic  neural  dynamics,  with  local  processes  obeying  membrane  equations 
(Appendix). 

We  focus  herein  upon  principles  and  mechanisms  that  are  capable  of  self-organizing 
stable  recognition  codes  in  response  to  arbitrary  temporal  sequences  of  input  patterns. 
These  principles  and  mechanisms  lead  to  the  design  of  a  neural  network  whose  parameters 
can  be  specialized  for  applications  to  particular  problem  domains,  such  as  speech  and 
vision.  In  these  domains,  preprocessing  stages  prepare  environmental  inputs  for  the  self¬ 
organizing  category  formation  and  recognition  system.  Work  on  speech  and  language 
preprocessing  has  characterized  those  stages  after  which  such  a  self-organizing  recognition 
system  can  build  up  codes  for  phonemes,  syllables,  and  words  [5-7].  Work  on  form  and  color 
preprocessing  has  characterized  those  stages  after  wdiich  such  a  self-organizing  recognition 
system  can  build  up  codes  for  visual  object  recognition  [8,9]. 

2.  Bottom-Up  Adaptive  Filtering  and  Contrast-Enhancement  in  Short  Term 
Memory.  We  now'  introduce  in  a  qualitative  way  the  main  mechanisms  of  the  theory.  We 
do  so  by  considering  the  typical  network  reactions  to  a  single  input  pattern  I  within  a 
temporal  stream  of  input  patterns.  Each  input  pattern  may  be  the  output  pattern  of 
a  preprocessing  stage.  The  input  pattern  I  is  received  at  the  stage  Fj  of  an  attentional 
subsystem.  Pattern  I  is  transformed  into  a  pattern  X  of  activation  across  the  nodes  of 
F]  (Figure  1).  The  transformed  pattern  X  represents  a  pattern  in  short  term  memory 
(STM).  In  Fi  each  node,  w^hose  activity  is  sufficiently  large  generates  excitatory  signals 
along  pathways  to  target  nodes  at  the  next  processing  stage  F2.  A  pattern  X  of  STM 
activities  across  Fj  hereby  elicits  a  pattern  S  of  output  signals  from  Fj.  When  a  signal 
from  a  node  in  Fj  is  carried  along  a  pathway  to  F2,  the  signal  is  multiplied,  or  gated,  by 
the  pathway’s  long  term  memory  (LTXI)  trace.  The  LTM  gated  signal  (i.e..  signal  times 
LTM  trace),  not  the  signal  alone,  reaches  the  target  node.  Each  target  node  sums  up  all 
of  its  LTM  gated  signals.  In  this  way.  pattern  S  generates  a  pattern  T  of  LTM-gated  and 
summed  input  signals  to  F2  (Figure  2a).  The  transformation  from  S  to  T  is  called  an 
adaptive  filter. 

The  input  pattern  T  to  F2  is  quickly  traiisforiiK'd  by  interactions  among  the  nodes 
of  F2.  These  interactions  contrast-enhance  the  input  pattern  T.  The  resulting  pattern  of 
activation  across  F2  is  a  new  pattern  Y.  The  contrast-enhanced  pattern  Y,  rather  than  the 
input  pattern  T,  is  stored  in  STM  by  F2. 

A  special  case  of  this  contrast-enhancement  process,  in  which  F2  chooses  the  node 
which  receives  the  largest  input,  is  here  considered.  The  chosen  node  is  the  only  one  that 
can  store  activity  in  STM.  In  more  general  versions  of  the  theory,  the  contrast  enhancing 
transformation  from  T  to  Y  enables  more  than  one  node  at  a  time  to  be  active  in  STM. 
Such  transformations  are  designed  to  simultaneously  represent  in  STM  many  subsets,  or 
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STM  ACTIVITY 
PATTERN  (Y) 


INPUT  PATTERN  (I) 

Figure  1.  Stages  of  bottom-up  activation:  The  injmt  pattern  I  generates  a  pattern  of 
STM  activation  X  across  Fj.  Sufficiently  active  Fj  nodes  emit  bottom-up  signals  to  F2. 
This  signal  pattern  S  is  gated  by  long  term  memory  (LTM)  traces  within  the  Fi  —>■  F2 
pathways.  The  LTM-gated  signals  are  summed  before  activating  their  target  nodes  in  F2. 
This  LTM-gated  and  summed  signal  pattern  T  generates  a  pattern  of  activation  Y  across 

F2- 

gronpings.  of  an  input  pattern  [6.10).  When  Fj  is  de.signed  to  make  a  choice  in  STM.  it 
selects  that  global  grouping  of  the  inj>ut  pattern  which  is  preferred  by  the  adaptive  filter. 
This  process  automatically  enables  the  network  to  partition  all  the  input  patterns  which 
are  received  by  F]  into  disjoint  sets  of  recognition  categories,  each  corresponding  to  a 
particular  node  in  F2. 

Only  those  nodes  of  F2  which  maintain  stored  activity  in  STM  can  elicit  new  learning 
at  contiguous  LTM  traces.  Whereas  all  the  LTM  traces  in  the  adaptive  filter,  and  thus 
all  learned  past  experiences  of  the  network,  are  used  to  determine  recognition  via  the 
transformation  1— >X— >S— »T— Y,  only  those  LTM  traces  whose  STM  activities  in  F2  survive 
the  contrast-enhancement  process  can  learn  in  response  to  the  activity  pattern  X. 
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Figure  2.  Search  for  a  correct  F2  code:  (a)  The  input  pattern  I  generates  the  specific 
STM  activity  pattern  X  at  F |  as  it  nonspecifically  activates  A.  Pattern  X  both  inhibits  .4 
and  generates  the  out]>ut  signal  pattern  S.  Signal  pattern  S  is  transformed  into  the  ini)iit 
pattern  T.  which  activates  the  STM  pattern  Y  across  Fj.  (b)  Pattern  Y  generates  the  to])- 
down  signal  pattern  U  which  is  transformed  into  the  template  pattern  V.  If  V  mismatches 
I  at  F],  then  a  new  STM  activity  pattern  X*  is  generated  at  Fj.  The  reduction  in  total 
STM  activity  which  occurs  when  X  is  transformed  into  X*  causes  a  decrease  in  the  total 
inhibition  from  Fi  to  >4.  (c)  Then  the  input-driven  activation  of  A  can  release  a  nonspecific 
arousal  wave  to  F2,  which  resets  the  STM  pattern  Y  at  F2.  (d)  After  Y  is  inhibited,  its 
top-down  template  is  eliminated,  and  X  can  be  reinstated  at  F].  Now  X  once  again 
generates  input  pattern  T  to  F2,  but  since  Y  remains  inhibited  T  can  activate  a  different 
STM  pattern  Y*  at  F2.  If  the  top-down  template  due  to  Y*  also  mismatches  I  at  F],  then 
the  rapid  search  for  an  appropriate  F2  code  continues. 
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The  bottorn-up  STM  transformation  I— T— Y  is  not  the  only  process  that 
regulates  network  learning.  In  the  absence  of  top-down  processing,  the  LTM  traces  within 
the  adaptive  filter  S— T  (Figure  2a)  can  respond  to  certain  sequences  of  input  patterns 
by  being  ceaselessly  recoded  in  such  a  way  that  individual  events  are  never  eventually 
encoded  by  a  single  category  no  matter  how  many  times  they  are  presented.  An  infinite 
class  of  examples  in  which  temporally  unstable  codes  evolve  is  described  in  Section  7.  It 
was  the  instability  of  bottoin-up  adaptive  coding  that  led  Grossberg  [11.12]  to  introduce 
the  adaptive  resonance  theory. 

In  the  adaptive  resonance  theory,  a  matching  process  at  Fj  cjxists  whereby  learned  top- 
down  expectancies,  or  templates,  from  F2  to  Fj  are  compared  with  the  bottom-up  input 
pattern  to  Fi .  This  matching  process  stabilizes  the  learning  that  emerges  in  response  to 
an  arbitrary  input  environment.  The  constraints  that  follow  from  the  need  to  stabilize 
learning  enable  us  to  choose  among  the  many  possible  versions  of  top-down  template 
matching  and  STM  processes.  These  learning  constraints  upon  the  adaptive  resonance  top- 
down  design  have  enabled  the  theory  to  explain  data  from  visual  and  auditory  information 
processing  experiments  in  which  learning  has  not  been  a  manipulated  variable  [4.6,7]. 
These  mechanisms  have  now  been  developed  into  a  rigorously  characterized  learning  system 
whose  properties  have  been  quantitatively  analysed  [2,3].  This  analysis  has  revealed  new 
design  constraints  within  the  adaptive  resonance  theory.  The  system  that  we  wfill  describe 
for  learned  categorical  recognition  is  one  outcome  of  this  analysis. 

Figure  3  summarizes  the  total  network  architecture.  It  includes  modulatory  processes, 
such  as  attentional  gain  control,  which  regulate  matching  within  Fj,  as  well  as  modulatory 
processes,  such  as  orienting  arousal,  which  regulate  reset  within  F2.  Figure  3  also  includes 
an  attentional  gain  control  process  at  F2.  Such  a  process  enables  offset  of  the  input  pattern 
to  terminate  all  STM  activity  within  the  attentional  subsystem  in  preparation  for  the  next 
input  pattern.  In  this  example.  STM  storage  can  persist  after  the  input  pattern  terminates 
only  if  an  internally  generated  or  intermodality  input  source  maintains  the  activity  of  the 
attentional  gain  control  system. 


3.  Top-Down  Template  Matching  and  Stabilization  of  Code  Learning.  We 

now  begin  to  consider  how  top-down  template  matching  can  stabilize  code  learning.  In 
order  to  do  so,  top-down  template  matching  at  Fj  must  be  able  to  prevent  learning  at 
bottom-up  LTM  traces  whose  contiguous  F2  nodes  are  only  momentarily  activated  in 
STM.  This  ability  depends  upon  the  different  rates  at  which  STM  activities  and  LTM 
traces  can  change.  The  STM  transformation  I — S^T— ^Y  takes  place  very  quickly. 
By  ‘'very  quickly"  we  mean  much  more  quickly  than  the  rate  at  which  the  LTM  traces  in 
the  adaptive  filter  S— T  can  change.  As  soon  as  the  bottom-up  STM  transformation  X-^Y 
takes  place,  the  .STM  activities  Y  in  F2  elicit  a  top-down  excitatory  signal  pattern  U  back 
to  F].  Only  sufficiently  large  STM  activities  in  Y  elicit  signals  in  U  along  the  feedback 
pathways  F2  — 'Fj . 

.4s  in  the  boTtom-u])  adaptive  filter,  the  toji-down  signals  U  are  also  gated  by  LTM 
trace?!  before  the  LTM-gaied  sigiial.?-  are  >iiniu]ed  at  Fj  nodes.  The  pattern  L'  of  output 
signals  from  Fj  hereby  generates  a  pattern  V  of  LTM-gated  and  summed  input  signals 
to  F].  The  transformation  from  V  to  V  is  thus  also  an  adaptive  filter.  The  pattern  Y'  is 
called  a  top-down  template,  or  learned  expectation  (Figure  2b). 

Two  sources  of  input  now  perturb  Fj:  the  bottom-up  input  pattern  I  which  gave  rise 
to  the  original  activity  pattern  X,  and  the  top-down  template  pattern  V  that  resulted  from 
activating  X.  The  activity  pattern  X’  across  F]  that  is  induced  by  I  and  4  tal^en  together 
is  typically  different  from  the  activity  pattern  X  that  wa.^  previously  induced  by  I  alone. 
In  particular.  F]  acts  to  match  V  against  1.  The  result  of  this  matching  process  determines 
the  future  course  of  learning  and  recognition  by  the  network. 
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Figure  3.  Ajuituiiiy  of  the  atlentioiial-orieiiting  system;  Tliis  figure  describes  al)  the 
interactions  of  the  model  without  regard  to  which  comjjoneiits  are  active  at  any  given 
time. 

The  entire  activation  sequence 

j  ^  X S  ^  V  ^  X*  (1) 

takes  place  very  quickly  relative  to  the  rate  w'ith  whicli  the  LTM  traces  in  either  the 
bottom-up  adaptive  filter  S — >T  cjr  the  top-down  ada])1ive  filter  U  can  change.  Even 
though  none  of  the  LTM  traces  changes  during  such  a  short  time,  their  prior  learning 
strongly  influences  the  STM  i)atterns  Y  and  X*  that  evolve  within  the  network.  We  now' 
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discuss  liow  a  match  or  mismatch  of  I  and  V  at  Fj  regulates  the  course  of  learning  in 
response  to  the  pattern  1. 

4.  Interactions  between  Attentional  and  Orienting  Subsystems;  STM  Reset 
and  Search.  This  section  outlines  how  a  mismatch  at  F]  regulates  the  learning  process. 
With  this  general  scheme  in  mind,  we  will  be  able  to  consider  details  of  how  bottom-up 
filters  and  top-down  templates  are  learned  and  how  matching  takes  place. 

Level  F)  can  compute  a  match  or  mismatch  between  a  bottom-up  input  pattern  I 
and  a  top-down  template  pattern  V,  but  it  cannot  compute  which  STM  pattern  Y  across 
F2  generated  the  template  pattern  V.  Thus  the  outcome  of  matching  at  F]  must  have  a 
nonspecific  effect  upon  F2  that  can  potentially  influence  all  of  the  F2  nodes,  any  one  of 
which  may  have  read-out  V.  The  internal  organization  of  F2  must  be  the  agent  whereby 
this  nonspecific  event,  which  we  call  a  reset  wave,  selectively  alters  the  stored  STM  activity 
pattern  Y.  The  reset  wave  is  one  of  the  three  types  of  nonspecific  arousal  that  exist  within 
the  netwmrk.  In  particular,  we  suggest  that  a  mismatch  of  I  and  V  within  F]  generates  a 
nonspecific  arousal  burst  that  inhibits  the  active  population  in  F2  which  read-out  V.  In 
this  way.  an  erroneous  STM  representation  at  F2  is  quickly  eliminated  before  any  LTM 
traces  can  encode  this  error. 

The  attentional  subsystem  works  together  with  an  orienting  .subsystem  to  carry  out 
these  interactions.  All  learning  takes  place  within  the  attentional  subsystem.  All  matches 
and  mismatches  are  computed  within  the  attentional  si:bsystem.  The  orienting  subsystem 
is  the  source  of  the  nonspecific  arousal  bursts  that  reset  STM  within  level  F2  of  the 
attentional  subsystem.  The  outcome  of  matching  witliin  Fj  determines  whether  or  not 
such  an  arousal  burst  will  be  generated  by  the  orienting  subsystem.  Thus  the  orienting 
system  mediates  reset  of  F2  due  to  mismatches  within  F  | . 

Figure  2  depicts  a  typical  interaction  between  the  attentional  subsystem  and  the  ori¬ 
enting  subsystem. In  Figure  2a.  an  input  pattern  I  instates  an  STM  activity  pattern  X 
across  F].  The  input  pattern  I  also  excites  the  orienting  population  A,  but  pattern  X  at 
F]  inhibits  A  before  it  can  generate  an  output  signal. 

Activity  pattern  X  also  generates  an  output  pattern  S  which,  via  the  bottom-up  adap¬ 
tive  filter,  instates  an  STM  activity  pattern  Y  across  F2.  In  Figure  2b,  pattern  Y  reads 
a  top-down  template  pattern  V  into  Fj.  Template  V  mismatches  input  I,  thereby  signifi¬ 
cantly  inhibiting  STM  activity  across  F,.  The  amount  by  which  activity  in  X  is  attenuated 
to  generate  X*  depends  upon  how  much  of  the  input  pattern  I  is  encoded  within  the  tem¬ 
plate  pattern  V. 

When  a  mismatch  attenuates  STM  activity  across  F  j ,  this  activity  no  longer  prevents 
the  arousal  source  A  from  firing.  Figure  2c  depicts  how  disinhibition  of  A  releases  a 
nonspecific  arousal  burst  to  F2.  This  arousal  burst,  in  turn,  selectively  inhibits  the  active 
population  in  F2.  This  inhibition  is  long-lasting.  One  physiological  design  for  F2  processing 
which  has  these  necessary  properties  is  a  dipole  field  [4.13].  A  dipole  field  consists  of 
opponent  processing  channels  wdiich  are  gated  by  habituating  chemical  transmitters.  A 
nonsqjecific  arousal  burst  induces  selectiv*  and  enduring  inhibition  within  a  dipole  field. 
In  Figure  2c.  inhibition  of  Y  leads  to  inhibition  of  the  top-down  template  \  .  and  thereby 
terminates  the  mismatch  between  I  and  \  .  Input  pattern  I  can  thus  reinstate  the  activity 
pattern  X  across  Fi,  which  again  generates  the  output  pattern  S  from  Fi  and  the  input 
pattern  T  to  F2.  Due  to  the  enduring  inhibition  at  F2-  the  input  pattern  T  can  no  longer 
activate  the  same  pattern  Y  at  F2.  A  new  pattern  A*  is  thus  generated  at  F2  by  I  (Figure 
2d).  Despite  the  fact  that  some  F2  nodes  may  remain  inhibited  by  the  STM  reset  property, 
the  new^  pattern  Y*  may  encode  large  STM  activities.  This  is  because  level  F2  is  designed 
so  that  its  total  suprat'hreshold  activity  remains  approximately  constant,  or  normalized, 
despite  the  fact  that  some  of  its  nodes  may  remain  inhibited  by  the  STM  reset  mechanism. 
This  property  is  related  to  the  limited  capacity  of  STM.  A  physiological  process  capable 
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of  achieving  the  STM  normalization  property  can  be  based  upon  on-center  off-surround 
interactions  among  cells  obeying  membrane  equations  [4.14]. 

The  new  activity  pattern  Y'  rcads-out  a  new  top-down  template  pattern  V‘.  If  a 
mismatch  again  occurs  at  Fi.  the  orienting  subsystem  is  again  engaged,  thereby  leading 
to  another  arousal-mediated  reset  of  STM  at  fV  In  this  way,  a  rapid  series' of  STM 
matching  and  reset  events  may  occur.  Such  an  STM  matching  and  reset  series  controls  the 
system  s  search  of  LTM  by  sequentially  engaging  the  novelty-sensitive  orienting  subsystem. 
Although  STM  is  reset  sequentially  in  time,  the  mechanisms  which  control  the  LTM  search 
are  all  parallel  network  interactions,  rather  than  serial  algorithms.  Such  a  parallel  search 
scheme  is  necessary  in  a  system  whose  LTM  codes  do  not  exist  a  priori.  In  general,  the 
spatial  configuration  of  codes  in  such  a  system  depends  upon  both  the  system's  initial 
configuration  and  its  unique  learning  history.  Consequently,  no  prewired  serial  algorithm 
could  possibly  anticipate  an  efficient  order  of  search. 

The  mismatch-mediated  search  of  LTAI  ends  when  an  STM  pattern  across  F2  reads-out 
a  top-down  template  whicii  either  matches  I.  to  the  degree  of  accuracy  required  by  the  level 
of  attentional  vigilance,  or  has  not  yet  undergone  any  prior  learning.  In  the  latter  case,  a 
nev  recognition  category  is  established  as  a  bottom-up  code  and  top-down  template  are 
learned. 

Y  e  now  begin  to  consider  details  of  the  bottom-up/top-down  matching  process  across 
F],  The  nature  of  this  matching  process  is  clarified  by  a  consideration  of  how  Fj  distin¬ 
guishes  betw'een  activation  by  bottom-up  inputs  and  top-down  templates. 

5,  Attentional  Gain  Control  and  Attentional  Priming.  The  importance  of 
the  distinction  between  bottom-up  and  top-down  processing  becomes  evident  when'  one 
observes  that  the  same  top-down  template  matching  process  which  stabilizes  learning  is 
also  a  mechanism  of  attentional  priming.  Consider,  for  example,  a  situation  in  which  F2 
is  activated  by  a  level  other  than  F]  before  Fj  is  itself  activated.  In  such  a  situation,  F2 
can  generate  a  top-down  template  to  Fj.  The  level  Fj  is  then  primed,  or  ready,  to 
receive  a  bottorn-up  input  that  may  or  may  not  match  the  active  expectancy.  Level  Fj 
can  be  primed  To  receive  a  bottorn-up  input  without  necessarily  eliciting  suprathreshold 
output  signals  in  response  to  the  priming  expectancy.  If  this  were  not  possible,  then  every 
prirning  event  would  lead  to  suprathreshold  consequences.  Such  a  property  would  prevent 
subliminal  anticipation  of  a  future  event. 

On  the  other  hand,  an  input  pattern  I  must  be  able  to  generate  a  suprathreshold 
activity  pattern  X  even  if  no  top-down  expectancy  is  active  across  Fj  (Figure  2).  How  does 
F ]  know  that  it  should  generate  a  suprathreshold  reaction  to  a  bottom-up  input  pattern 
but  not  to  a  top-down  input  pattern?  In  both  cases,  an  input  pattern  stimulates  Fj  cells. 
Some  auxiliary  mechanism  must  exist  to  distinguish  between  bottom-up  and  top-down 
inputs.  We  call  this  auxiliary  mechanism  attentional  gain  control  to  distinguish  it  from 
attentioiial  priming  by  the  top-down  Template  itself.  The  attentional  priming  mechanism 
delivers  specific  template  patterns  to  Fj.  The  attentional  gain  control  mechanism  has  a 
nonspecific  effect  on  the  sensitivity  with  wliich  Fj  responds  to  the  template  pattern,  as 
well  as  To  other  potterns  received  by  Fj.  Attentional  gain  control  is  one  of  the  three  types 
of  nonspecific  arousal  that  exist  within  The  network.  With  the  addition  of  attentional  gain 
control,  we  can  explain  qualitatively  how  Fj  can  tell  the  difference  between  bottom-up  and 
top-down  signal  patterns. 

The  need  to  dissociate  qttentional  priming  from  attentional  gain  control  can  also  be 
seen  from  the  fact  that  top-down  priming  events  do  not  lead  necessarily  to  subliminal  reac¬ 
tions  at  Fj.  L'nder  certain  circumstances,  top-down  expectancies  can  lead  to  suprathresh¬ 
old  consequence.s.  Ye  can.  for  example,  experience  internal  conversations  or  images  at  will. 
Thus  there  exists  a  difference  between  the  read-out  of  a  top-down  template,  which  is  a 
mechanism  of  attentional  priming,  and  the  translation  of  this  operation  into  suprathreshold 
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signals  due  to  atteritional  gain  control.  An  “act  of  will’’  ran  amplify  atteiitional  gain  con¬ 
trol  signals  to  elicit  a  suprathreshold  reaction  at  F]  in  response  to  an  attentional  priming 
pattern  from  F2. 

Figure  4  depicts  one  possible  scheme  whereby  supraliminal  reactions  to  l>ottom-up 
signals,  subliminal  reactions  to  top-down  signals,  and  supraliminal  reactions  to  matched 
bottom-up  and  top-down  signals  can  be  achieved.  Figure  4d  shows,  in  addition,  how 
competitive  interactions  across  modalities  can  prevent  Fi  Irom  generating  a  supraliminal 
reaction  to  bottorn-up  signals,  as  when  attention  shifts  from  one  modality  to  another. 

6.  Matching:  The  2/3  Rule.  We  can  now  outline  the  matching  and  coding  proper¬ 
ties  that  are  used  to  generate  learning  of  self-stabilizing  recognition  categories.  Two  dif¬ 
ferent  types  of  properties  need  to  be  articulated:  the  bottom-up  coding  properties  which 
determine  the  order  of  search,  and  the  top-down  matching  properties  which  determine 
whether  an  STM  reset  event  will  be  elicited.  Order  of  search  is  determined  entirely  by 
properties  of  the  attentional  subsystem.  The  choice  between  STM  reset  and  STM  res¬ 
onance  is  dependent  upon  whether  or  not  the  orienting  subsystem  will  generate  a  reset 
wave.  This  computation  is  based  on  inputs  received  by  the  orienting  subsystem  from  both 
the  bottom-up  input  pattern  I  and  the  STM  pattern  which  F]  computes  within  the  at¬ 
tentional  subsystem  (Figure  2).  Both  the  order  of  search  and  the  choice  between  reset 
and  resonance  are  sensitive  to  the  matched  patterns  a  whole.  This  global  sensitivity  is 
key  to  the  design  of  a  single  system  capable  of  matching  patterns  in  which  the  number  of 
coded  features,  or  details,  may  vary  greatly.  Such  global  context-sensitivity  is  needed  to 
determine  whether  a  fixed  amount  of  mismatch  should  be  treated  as  functional  noise,  or  as 
an  event  capable  of  eliciting  search  for  a  different  category.  For  example,  one  or  two  details 
may  be  sufficient  to  differentiate  two  small  but  functionally  distinct  patterns,  whereas  the 
same  details,  embedded  in  a  large,  complex  pattern  may  be  quite  irrelevant. 

We  first  discuss  the  properties  which  determine  the  order  of  search.  Network  interac¬ 
tions  which  control  search  order  can  be  described  in  terms  of  three  rules;  the  2/3  Rule, 
the  Weber  Law  Buie,  and  the  Associative  Decay  Rule. 

The  2/3  H.ule  follows  naturally  from  the  distinction  between  attentional  gain  control 
and  attentional  priming.  It  says  that  two  out  of  three  signal  sources  must  activate  an  Fj 
node  in  order  for  that  node  to  generate  suprathreshold  output  signals.  In  Figure  4a,  for 
example,  during  bottom-up  processing,  a  suprathreshold  node  in  Fj  is  one  which  receives 
a  specific  input  from  the  input  pattern  I  and  a  non^})ecific  attentional  gain  control  signal. 
All  other  nodes  in  Fi  receive  only  the  nonspecific  gain  control  signal.  Since  these  cells 
receive  inputs  from  only  one  pathway  they  do  not  fire. 

In  Figure  4b,  during  top-down  processing,  or  priming,  some  nodes  in  Fi  receive  a 
template  signal  from  F2,  whereas  other  nodes  receive  no  signal  whatsoever.  All  the  nodes 
of  F]  receive  inputs  from  at  most  one  of  their  three  possible  input  sources.  Hence  no  cells 
in  F^  are  supraliminally  activated  by  a  top-down  template. 

During  simultaneous  bottom-uji  and  top-down  signalling,  the  attentional  gain  control 
signal  is  inhibited  by  the  top-down  channel  (Figure  4c  Despite  this  fact,  some  nodes  of 
F]  may  receive  sufficiently  large  irqiuts  from  both  the  bottom-up  and  the  top-down  signal 
patterns  to  generate  suprathreshold  outputs.  Other  nodes  may  receive  inputs  from  the 
top-down  template  pattern  or  the  bottom-up  input  pattern,  but  not  both.  These  nodes 
receive  signals  from  only  one  of  their  possible  sources,  hence  do  not  fire.  Cells  which  receive 
no  inputs  do  not  fire  either.  Thus  only  cells  that  are  conjointly  activated  by  the  bottom- 
up  input  and  the  top-down  template  can  fire  when  a  top-down  template  is  active.  The 
2/3  Rule  clarifies  the  apparently  paradoxical  process  whereby  the  addition  of  top-down 
excitatory  inputs  to  F]  can  lead  to  an  overall  decrease  in  Fi  's  STM  activity  (Figures  2a 
and  2b). 
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Figure*  4.  Matcliiii};  by  2/3  Fulo:  (a)  In  this  ('xaiJi])lc'.  iioi)sj)C’cific  iittcnitional  gain  control 
signals  arc  phasically  activated  by  the  bottom-up  in])ut .  In  this  network,  the  bottom-up 
input  arouses  two  different  nonspecific  channels:  the  attentional  gain  control  channel  and 
the  orienting  subsystem.  Only  F  ]  cells  that  receive  bottom-up  inputs  and  gain  control 
signals  can  become  supraliminally  active,  (b)  A  toi)-down  template  from  F2  inhibits  the 
attentional  gain  control  source  as  it  subliminally  primes  target  F]  cells,  (c)  When  a 
bottom-up  input  pattern  and  a  top-down  template  are  simultaneously  active,  only  those 
F]  cells  that  receive  inputs  from  both  sources  can  become  supraliminally  active,  since  the 
gain  control  source  is  inhibited,  (d)  Intermodality  inhibition  can  shut  off  the  gain  control 
source  and  thereby  prevent  a  bottom-up  input  from  supraliminally  activating  F]. 
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7.  Example  of  Code  Instability.  We  now  illustrate  the  importance  of  the  2/3  Rule 
by  describing  how  its  absence  can  lead  to  a  temporally  unstable  code.  In  the  simplest 
type  of  code  instability  example,  the  code  becomes  unstable  because  neither  top-down 
template  nor  reset  mechanisms  exist  [llj.  Then,  in  response  to  certain  input  sequences 
that  are  repeated  through  time,  a  given  input  pattern  can  be  ceaselessly  recoded  into 
more  than  one  category.  In  the  example  that  we  will  now  describe,  the  top-down  template 
signals  are  active  and  the  reset  mechanism  is  functional.  However,  the  inhibitory  top-dowm 
attentional  gain  control  signals  (Figures  3  and  4c)  are  chosen  too  small  for  the  2/3  Rule  to 
hold  at  Fi .  We  show  also  that  a  larger  choice  of  attentional  gain  control  signals  restores 
code  stability  by  reinstating  the  2/3  Rule.  These  simulations  also  illustrate  three  other 
points:  how  a  novel  exemplar  can  directly  access  a  previously  established  category;  how 
the  category  in  which  a  given  exemplar  is  coded  can  be  influenced  by  the  categories  which 
form  to  encode  very  different  exemplars:  and  how  the  network  responds  to  exemplars  as 
coherent  groupings  of  features,  rather  than  to  isolated  feature  matches  or  mismatches. 

Figure  5a  summarizes  a  computer  simulation  of  unstable  code  learning.  Figure  5b 
summarizes  a  computer  simulation  that  illustrates  how  reinstatement  of  the  2/3  Rule  can 
stabilize  code  learning. 

The  first  column  of  Figure  5a  describes  the  four  input  patterns  that  were  used  in  the 
simulation.  These  input  patterns  are  labeled  A.  B.  C.  and  D.  Patterns  B.  C,  and  D  are 
all  subsets  of  A.  The  relationships  among  the  inputs  that  make  the  simulation  work  are  as 
follows: 

Code  Instability  Example 


D  C  C  C  A. 

(2) 

B  C  A. 

(3) 

Bf]C  ^0, 

(4) 

>|<|B|<iCi. 

(5) 

These  results  thus  provide  infinitely  many  examples  in  which  an  alphabet  of  just  four  input 
patterns  cannot  be  stably  coded  without  the  2/3  Rule.  The  numbers  1,  2,  3,  ...  listed 
in  the  second  column  itemize  the  presentation  order.  The  third  column,  labeled  BU  for 
Bottom-Up.  describes  the  input  pattern  that  was  presented  on  each  trial.  In  both  Figures 
5a  and  5b,  the  input  patterns  w'ere  periodically  presented  in  the  order  ABCAD. 

Each  of  the  Top-Down  Template  columns  in  Figure  5  corresponds  to  a  different  node 
in  F2,  with  column  1  corresponding  to  node  i>i.  column  2  corresponding  to  node  t;2,  and 
so  on.  Each  row  summarizes  the  network  response  to  its  input  pattern.  The  symbol  RES, 
which  stands  for  resonance,  designates  the  node  in  F2  which  codes  the  input  pattern  on 
that  trial.  For  example,  t>2  codes  pattern  C  on  trial  3,  and  t]  codes  pattern  B  on  trial 
7.  The  patterns  in  a  given  row  describe  the  templates  after  learning  has  occurred  on  that 
trial. 

In  Figure  5a.  in])ut  pattern  A  is  ]<eriodically  reccuU'd:  On  trial  1.  it  is  coded  by  v-^:  on 
trial  4.  it  is  coded  by  j'2-  trial  0.  it  is  coded  by  j'i;  on  trial  9,  it  is  coded  by  i>2.  This 
alternation  in  the  nodes  t'l  and  I2  which  code  pattern  A  repeats  indefinitely. 

Violation  of  the  2/3  Rule  occurs  on  trials  4.  6.  8.  9.  and  so  on.  This  violation  is 
illustrated  by  comparing  the  template  of  t'2  on  trials  3  and  4.  On  trial  3,  the  template  of 
i’2  is  coded  by  pattern  C.  which  is  a  subset  of  pattern  A.  On  trial  4,  pattern  A  is  presented 
and  directly  activates  node  V2-  Because  the  2/3  Rule  does  not  hold,  pattern  A  remains 
supraliminal  in  Fj  even  after  the  subset  template  (.'  is  read-out  from  v-j-  Thus  no  search 
is  elicited  by  the  mismatch  of  pattern  A  and  its  subset  template  C.  Consequently  the 
template  of  ih  is  recoded  from  pattern  C  to  its  superset  pattern  A. 
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Figure  5.  Stabilization  of  categorical  learning  by  the  2/3  Rule:  In  both  (a)  and  (b),  four 
input  patterns  A.  B,  C’,  and  D  arc  presented  rei>eat<'dly  in  the  list  order  ADCAD.  In  (a), 
the  2/3  Rule  is  violated  because  the  top-down  inhibitory  gain  control  mechanism  be  weak 
(Figure  4c).  Pattern  A  is  periodically  coded  by  ?>]  and  V2-  It  is  never  coded  by  a  single 
stable  category.  In  (b),  the  2/3  Rule  is  re.stored  by  strengthening  the  top-down  inhibitory 
gain  control  mechanism.  After  some  initial  recoding  during  the  first  two  presentations  of 
ABCAD.  all  patterns  directly  acce.ss  distinct  .stable  categories. 
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In  Figure  5b,  by  contrast,  the  2/3  Rule  does  hold  due  to  a  larger  choice  of  the  atten- 
tional  gain  control  parameter.  Thus  the  network  experiences  a  sequence  of  recodings  that 
ultimately  stabilizes.  In  particular,  on  trial  4.  node  v-j  reads-out  the  subset  template  C, 
which  mismatches  the  input  pattern  A.  The  numbers  beneath  the  template  symbols  in  row 
4  describe  the  order  of  search.  First,  i’2  s  template  C  mismatches  A.  Then  Ui’s  template 
B  mismatches  A.  Finally  A  activates  the  uncommitted  node  t;3,  which  resonates  with  Fj 
as  it  learns  the  template  A. 

Scanning  the  rows  of  Figure  5b.  we  see  that  pattern  A  is  coded  by  on  trial  1;  by  vg 
on  trials  4  and  6;  and  by  V4  on  trial  9.  On  all  future  trials,  input  pattern  A  is  coded  by 
V4.  Moreover,  all  the  input  patterns  A,  B.  C.  and  D  have  learned  a  stable  code  by  trial  9. 
Thus  the  code  selFstabilizes  by  the  second  run  through  the  input  list  ABCAD.  On  trials  11 
through  15,  and  on  all  future  trials,  each  input  pattern  chooses  a  different  node  [A  v^', 
B  v\\  C  — »  1^3;  D  i’2)-  Each  pattern  belongs  to  a  separate  category  because  the 
vigilance  parameter  was  chosen  to  be  large  in  this  example.  Moreover,  after  code  learning 
stabilizes,  each  input  pattern  directly  activates  its  node  in  F2  without  undergoing  any 
additional  search.  Thus  after  trial  9.  only  the  "RES"  symbol  appears  under  the  top-dowm 
templates.  The  patterns  showui  in  any  row  between  9  and  15  provide,  a  complete  description 
of  the  learned  code.  Examples  of  how  a  novel  exemplar  can  activate  a  previously  learned 
category  are  found  on  trials  2  and  5  in  Figures  5a  and  5b.  On  trial  2.  for  example,  pattern 
B  is  presented  for  the  first  time  and  directly  accesses  the  category  coded  by  Vj ,  which  was 
previously  learned  by  pattern  A  on  trial  1.  In  terminology  from  artificial  intelligence.  B 
activates  the  same  categorical  "pointer,”  or  “marker,”  or  “index”  as  in  A.  In  so  doing, 
B  does  not  change  the  categorical  “index,”  but  it  may  change  the  categorical  template, 
which  determines  which  input  parterns  will  also  be  coded  by  this  index  on  future  trials. 
The  category  does  not  change,  but  its  invariants  may  change. 

An  example  of  how  presentation  of  very  different  input  patterns  can  influence  the 
category  of  a  fixed  input  pattc  rn  is  found  through  consideration  of  trials  1,  4,  and  9  in 
Figure  5b.  These  are  the  trials  on  which  pattern  A  is  recoded  due  to  the  intervening 
occurrence  of  other  input  patterns.  On  trial  1,  pattern  A  is  coded  by  V].  On  trial  4,  A  is 
recoded  by  1^3  because  pattern  B  has  also  been  coded  by  U]  and  pattern  C  has  been  coded 
by  i-'2  in  the  interim.  On  trial  9,  pattern  A  is  recoded  by  V4  both  because  pattern  C  has 
been  recoded  by  U3  and  pattern  D  has  been  coded  by  i>2  in  the  interim. 

In  all  of  these  transitions,  the  global  structure  of  the  input  pattern  determines  which 
F2  nodes  will  be  activated,  and  global  measures  of  pattern  match  at  Fj  determine  whether 
these  nodes  will  be  reset  or  allowed  to  resonate  in  STM. 


8.  Vigilance,  Orienting,  and  Reset.  We  now  show  how  matching  within  the 
attentional  subsystem  at  Fj  determines  whether  or  not  the  orienting  subsystem  will  be 
activated,  thereby  leading  to  reset  of  the  attentional  subsystem  at  F2.  The  discussion  can 
be  broken  into  three  parts: 

A.  Distinguishing  Active  Mismatch  from  Passive  Inactivity 

A  severe  mismatch  at  Fj  activates  the  orienting  subsystem  A.  In  the  worst  possible 
case  of  mismatch,  none  of  the  F  j  nodes  can  satisfy  the  2/3  Rule,  and  thus  no  supraliminal 
activation  of  F 1  can  occur.  Thus  in  the  worst  ca.se  of  mismatch,  wherein  F 1  becomes  totally 
inactive,  the  orienting  subsystem  must  surely  be  engaged. 

On  the  other  hand,  Fj  may  be  inactive  simply  because  no  inputs  whatsoever  are  being 
processed.  In  this  case,  activation  of  the  orienting  subsystem  is  not  desired.  How  does  the 
network  compute  the  difference  between  active  mismatch  and  passive  inactivity  at  Fj? 

This  question  led  Grossberg  [4]  to  assume  that  the  bottom-up  input  source  activates 
two  parallel  channels  (Figure  2a).  The  attentional  subsystem  receives  a  specific  input 
pattern  at  Fi .  The  orienting  subsystem  receives  convergent  inputs  at  A  from  all  the  active 
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input  pathways.  Thus  the  orienting  subsy.stem  can  be  activated  only  when  Fj  is  actively 
processing  bottom-up  inputs. 

B.  Competition  between  the  Attentional  and  Orienting  Subsystems 

How,  then,  is  a  bottom-up  input  prevented  from  resetting  its  own  F2  code?  What 
mechanism  prevents  the  activation  of  A  by  the  bottom-up  input  from  always  resetting  the 
STM  representation  at  F2?  Clearly  inhibitory  pathways  must  exist  from  Fi  to  A  (Figure 
2a).  When  F]  is  sufficiently  active,  it  prevents  the  bottom-up  input  to  A  from  generating 
a  reset  signal  to  F2.  When  activity  at  Fj  is  attenuated  due  to  mismatch,  the  orienting 
subsystem  A  is  able  to  reset  F2  (Figure  2b,c,d).  In  this  way,  the  orienting  subsystem  can 
distinguish  betw'een  active  mismatch  and  passive  inactivity  at  Fj . 

Within  this  general  framework,  we  now  show  how  a  finer  analysis  of  network  dynamics, 
wfith  particular  emphasis  on  the  2/3  Rule,  leads  to  a  vigilance  mechanism  capable  of 
regulating  how  coarse  the  learned  categories  will  be. 

C.  Collapse  of  Bottom-Up  Activation  due  to  Template  Mismatch 

Suppose  that  a  bottom-up  input  pattern  has  activated  Fj  and  blocked  activation  of  A 
(Figure  2a).  Suppose,  moreover,  that  Fj  activates  an  F2  node  which  reads-out  a  template 
that  badly  mismatches  the  bottom-up  input  at  Fj  (Figure  2b).  Due  to  the  2/3  Rule,  many 
of  the  F]  nodes  which  w'ere  activated  by  the  bottom-up  input  alone  are  suppressed  by  the 
top-down  template.  Suppose  that  this  mismatch  event  causes  a  large  collapse  in  the  total 
activity  across  Fj,  and  thus  a  large  reduction  in  the  total  inhibition  which  Fj  delivers  to 
.4.  If  this  reduction  is  sufficiently  large,  then  the  excitatory  bottom-up  input  to  A  may 
succeed  in  generating  a  nonspecific  reset  signal  from  ^  to  F2  (Figure  2c). 

In  order  to  characterize  when  a  reset  signal  will  occur,  we  make  the  following  natural 
assumptions.  Suppose  that  an  input  pattern  I  sends  positive  signals  to  |  /  |  nodes  of  Fj. 
Since  every  active  input  pathway  projects  to  A,  I  generates  a  total  input  to  A  that  is 
proportional  to  |  /  |.  We  suppose  that  .4  reacts  linearly  to  the  total  input  ^  |  /  |.  We  also 
assume  that  each  active  Fi  node  generate^;  an  inhibitory  signal  of  fixed  size  to  A.  Since 
every  active  Fj  node  projects  to  A,  the  total  inhibitory  input  6  |  A'  |  from  Fj  to  A  is 
proportional  to  the  number  |  A'  |  of  active  Fj  nodes.  When  ')  |  /  |>  ^  |  X  |,  A  receives  a 
net  excitatory  signal  and  generates  a  nonspecific  reset  signal  to  F2  (Figure  2c). 

In  response  to  a  bottom-up  input  pattern  I  of  size  |  7  |,  as  in  Figure  2a,  the  total 
inhibitory  input  from  Fj  to  A  equals  6  |  /  |,  so  the  net  input  to  A  equals  (7  —  6)  |  /  |.  In 
order  to  prevent  A  from  firing  in  this  case  (Figure  2a),  we  assume  that  (5  >  7.  We  call 


the  vigilance  parameter  of  the  orienting  subsystem.  The  constraints  ^  ^  7  >  0  are  equiv¬ 
alent  to  0  <  p  <  1.  The  size  of  p  determines  the  proportion  of  the  input  pattern  which 
must  be  matched  in  order  to  prevent  reset. 

When  both  a  bottom-up  input  I  and  a  top-down  template  are  simultaneously 
active  (Figure  2b),  the  2/3  Rule  implies  that  the  total  inhibitory  signal  from  F]  to  A 

equals  ^  |  fl  /  |.  In  this  case,  the  orienting  subsystem  is  activated  only  if 

7  I  / 1>  <3 1  n  /  I;  (7) 

that  is,  if 


In  order  to  illustrate  how'  the  netwmrk  codifies  a  series  of  patterns,  we  show  in  Figure 
6  the  first  20  trials  of  a  simulation  using  alphabet  letters  as  input  patterns.  In  Figure  6a, 
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Figure  C.  Alphabet  learning:  Diflcrent  vigilance  levels  cause  difierent  numbers  of  letter 
categories  to  form. 
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the  vigilance  parameter  p  =  .5.  In  Figure  6b.  p  —  ,8.  Three  properties  are  notable  in 
these  simulations.  First,  choosing  a  different  vigilance  parameter  can  determine  different 
coding  histories,  such  that  higher  vigilance  induces  coding  into  finer  categories.  Second, 
the  network  modifies  its  search  order  on  each  trial  to  reflect  the  cumulative  effects  of  prior 
learning,  and  bypasses  the  orienting  system  to  directly  access  categories  after  learning  has 
taken  place.  Third,  the  templates  of  coarser  categories  tend  to  be  more  abstract  because 
they  must  approximately  match  a  larger  number  of  input  pattern  exemplars. 

Given  p  ~  .5.  the  network  groups  the  26  letter  patterns  into  8  stable  categories  within 
3  presentations.  In  this  simulation,  F2  contains  15  nodes.  Thus  7  nodes  remain  uncoded 
because  the  network  self-stabilizes  its  learning  after  satisfying  criteria  of  vigilance  and 
global  code  self-consistency.  Given  p  —  .S  and  15  F2  nodes,  the  network  groups  25  of  the 
26  letters  into  15  stable  categories  within  3  presentations.  The  26th  letter  is  rejected  by 
the  network  in  order  to  self-stabilize  its  learning  while  satisfying  its  criteria  of  vigilance  and 
global  code  self-consistency.  These  simulations  show  that  the  network's  use  of  processing 
resources  depends  upon  an  evolving  dynamical  organization  with  globally  context-sensitive 
properties.  This  class  of  networks  is  capable  of  organizing  arbitrary  sequences  of  arbitrarily 
complex  input  patterns  into  stable  categories  subject  to  the  constraints  of  vigilance,  global 
code  self-consistency,  and  number  of  nodes  in  Fj  and  F2- 

APPENDIX 

NETWORK  EQUATIONS 


STM  Equations 

The  STM  activity  of  any  node  in  Fi  or  Fb  obeys  a  membrane  equation  of  the  form 

=  -Axk  -f  (R  -  Cjk)J^  -  DxkJj^,  (.41) 

w'here  and  are  the  total  excitatory  input  and  total  inhibitory  input,  respectively, 
to  vi^  and  A,  B.  C.  D  are  nonnegative  parameters.  If  C  >  0,  then  the  STM  activity  xi^[t) 
remains  within  the  finite  interval  [0,  RC“*]  no  matter  how'  large  the  inputs  and  are 
chosen. 

We  denote  nodes  in  Fj  by  t,',i  where  i  —  1.2.  ..  .A/.  We  denote  nodes  in  F2  by  Vj, 
where  j  —  M  1,  M  +  2, N .  Thus  by  (Al), 

d 

=  -AiX,  -1-  (Ri  -  C]Xj).7~  -  DiX,J~  [A2) 

and 

^-Xj  —  —A2Xj  -h  {B2  —  C’2.r,j.7^  -  D2XjJ ^  .  (A3) 

The  input  is  a  sum  of  the  bottom-up  input  7,  and  the  top-down  template 

r,  =  y/(i,)i,„  (.44) 

3 

that  is, 

~  I,  -l- 1  ,.  (.45) 


52 


where  f{Xj)  is  the  signal  generated  by  activity  Xj  of  and  Zj,  is  the  LTM  trace  in  the 
pathway  from  Vj  to  Vi. 

The  inhibitory  input  J~  controls  the  attentional  gain: 

•^r  =  cy/(x,).  {A6) 

} 


Thus  Jj  =  0  if  and  only  if  F2  is  inactive  (Figure  4). 

The  inputs  and  parameters  of  STM  activities  in  F2  were  chosen  so  that  the  F2  node 
which  received  the  largest  input  from  Fj  wins  the  competition  for  STM  activity.  Theorems 
show  how  these  parameters  can  be  chosen  [15-17].  The  inputs  JJ  and  J~  have  the 
following  form. 

Input  adds  a  positive  feedback  signal  g{Xj)  from  t'j  to  itself  to  the  bottom-up 
adaptive  filter  input 

r,  =  {A7) 


that  is, 


Jj  -  g{xj)  +  Tj. 


[AS] 


where  h(xj)  is  the  signal  emitted  by  v^  and  z^j  is  the  LTM  trace  in  the  pathway  from  t',  to 
Vj.  Input  J~  adds  up  negative  feedback  signals  gixk)  from  all  the  other  nodes  in  F2: 


Such,  a  network  behaves  approximately  like  a  binary  switching  circuit: 

^  _  I  G  if  T,  >  m&x[T^.  ■  k  j) 

^  1 0  otherwise. 


(.49) 


{AlO) 


LTM  Equations 

The  LTM  trace  of  the  bottom-up  pathway  from  ij  to  ly  obeys  a  learning  equation  of 
the  form 

^Zjj  =  f(xj)[-HtjZ,j  -t  A'/2(i,)].  (.411) 

In  (All),  term  /(x,)  is  a  postsynaptic  sampling,  or  iFaniing.  signal  because  f(xj)  —  0 
implies  =  0.  Term  /(x,)  is  also  the  outpxxt  signal  of  r,  to  pathways  from  ij  to  Fj,  as 

in  (.44). 

The  LTM  trace  of  the  top-down  pathway  from  ij  to  u,  also  obeys  a  learning  equation 
of  the  form 

=  f(Xj)\-Hj,z^,  +  A'h(x,)].  (A12) 

In  the  present  simulations,  the  simplest  choice  of  was  made  for  the  top-down  LTM 
traces: 

Hj,  =  H  —  constant.  (A13) 
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A  more  complex  choice  of  Hji  was  made  for  the  bottom-up  LTM  traces.  This  was 
done  to  directly  generate  the  Weber  Law  Rule  [2]  via  the  bottom-up  LTM  process  itself. 
The  Weber  Law  Rule  can  also  be  generated  indirectly  by  exploiting  a  Weber  Law  property 
of  competitive  STM  interactions  across  Fj.  Such  an  indirect  instantiation  of  the  Weber 
Law  Rule  enjoys  several  advantages.  In  particular,  it  would  enable  us  to  also  choose 
Hji  —  H  =  constant.  Instead,  we  allowed  the  bottom-up  LTM  traces  at  each  node  Vj 
to  compete  among  themselves  for  synaptic  sites.  Malsburg  and  Willshaw  [18]  have  used 
a  related  idea  in  their  model  of  retinotectal  development.  In  the  present  usage,  it  was 
essential  to  choose  a  shunting  competition  to  generate  the  Weber  Law  Rule,  unlike  the 
Malsburg  and  Willshaw  usage.  Thus  we  let 


Hji  =  Lh(ii)  -I-  Y,  h(xk)-  (A14) 

A  physical  interpretation  of  this  choice  can  be  seen  by  rewriting  (All)  in  the  form 

k:^1 

By  (A15),  when  the  postsynaptic  signal  /(xy)  is  positive,  a  positive  presynaptic  signal 
h{xi)  commits  receptor  sites  to  the  LTM  process  Zij  at  a  rate  (A'  —  Lz^j)h[:^)f[xj).  Si¬ 
multaneously.  signals  h[xk),  k  ^  i.  which  reach  Vj  at  different  regions  of  the  ij  membrane 
compete  for  sites  which  are  already  committed  to  z,j  via  the  mass  action  competitive  terms 
—  Zijf(xj)h{Ti^].  When  equilibrate.s  to  these  compering  signals, 


.  =  .  .  (A16) 

{L~i)h{x,)  +  u-h{^ky  ^  ^ 

The  signal  function  h(w')  was  chosen  to  rise  quickly  from  0  to  1  at  a  threshold  activity 
level  wq.  Thus  if  ly  is  a  suprathreshold  node  in  Fj,  (A16)  approximates 

K 

“  (L  -  l]+  i  X  I 

where  j  X  |  is  the  numVjer  of  active  nodes  in  Fi .  Term  Zij  obeys  a  Weber  Law  Rule  if 
L>1. 

STM  Reset  System 

The  simplest  possible  mismatch-mediated  activation  of  .4  and  STM  reset  of  F2  by  A 
were  implemented  in  the  simulations.  As  outlined  in  .Secrion  3,  each  active  input  pathway 
sends  an  excitatory  signal  of  size  q  to  A.  Potentials  Xi  of  F ]  which  exceed  a  signal  threshold 
T  generate  an  inhibitory  signal  of  size  —6  to  A.  Population  A,  in  turn,  generates  a 
nonspecific  reset  wave  to  F2  whenever 

-  I  7  I  ^6  I  X  |>  0.  (A18) 

where  I  is  the  current  input  pattern  and  |  A"  \  is  the  number  of  nodes  across  F  j  such  that 
Xi  >  T.  The  nonspecific  reset  wave  shuts  off  the  active  F^  node  until  the  input  pattern  I 
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shuts  off.  Thus  (AlO)  must  be  modified  to  shut  off  all  F2  nodes  which  have  been  reset  by 
A  during  the  presentation  of  I. 
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ABSTRACT.  Neural  network  models  of  boundary  completion,  textural  segmentation, 
and  perceptual  grouping  possess  novel  mathematical  properties  with  implications  in  psy¬ 
chology,  neurobiology,  artificial  intelligence,  geometry,  statistical  mechanics,  and  decision 
theory.  These  neural  circuits  explain  color  and  boundary  data  as  well  as  textural  grouping 
phenomena.  Some  of  the  circuits  have  recently  received  experimental  support  from  neuro¬ 
physiological  recordings  from  monkey  visual  cortex.  The  circuits  suggest  a  new  approach 
to  the  design  of  computer  vision  systems. 

1.  Noiiliuear  Dynamical  Systems  in  Visual  Perception,  Neurobiology,  and  Arti¬ 
ficial  Intelligence.  The  venerable  subject  of  visual  perception  offers  the  modern  student 
of  nonlinear  dynamical  systems  a  wealth  of  new  phenomena  and  concepts  that  are  ripe  for 
development.  This  is  partly  because  the  profound  issues  of  visual  perception,  like  those 
of  number  theory,  are  often  revealed  through  the  use  of  immediately  accessible  materials. 
Just  as  many  of  the  deepest  questions  about  the  integers  can  be  presented  to  any  student, 
many  of  the  deepest  phenomena  concerning  visual  perception  can  be  readily  seen  by  a 
casual  observer  (Figure  1).  Easily  stated  number  theoretic  questions  have  led  to  some  of 
the  world’s  most  profound  mathematics.  Efforts  to  explain  visual  phenomena  have  also 
given  rise  to  increasingly  abstract  concepts  since  the  pioneering  work  of  Helmholtz,  Mach, 
and  Maxwell. 

The  perceptual  processing  theory  that  we  and  our  colleagues  are  now  developing  has  led 
to  the  discovery  of  new  dynamical  systems  models  of  nonlinear  competition,  cooperation, 
diffusion,  and  resonance.  These  models  have  been  used  to  quantitatively  simulate  on  the 
computer  many  visual  percepts  that  have  not  been  explained  by  previous  theories,  to  link 
these  percepts  to  recent  neurophysiological  and  anatomical  data  (Cohen  and  Grossberg, 
1984a,  1984b;  Grossberg,  1983a,  1983b,  1984;  Grossberg  and  Mingolla,  1985a,  1985b), 
and  to  make  some  neural  predictions  which  have  recently  been  confirmed  (Grossberg  and 
Mingolla,  1985a). 

As  befits  formal  theories  which  model  important  physical  phenomena,  these  dynami¬ 
cal  systems  exhibit  a  wide  variety  of  mathematical  properties  and  have  led  to  new  types 
of  theorems.  For  example,  certain  dynamical  systems  of  this  type  have  the  remarkable 
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property  that,  despite  their  very  high  dimension,  they  always  approach  one  of  perhaps 
infinitely  many  equilibrium  points  given  any  physical  choice  of  initial  values  and  param¬ 
eters  (Cohen  and  Grossberg,  1983;  Grossberg,  1978,  1980).  A  wide  variety  of  complex 
oscillations,  including  period  doubling  and  chaotic  oscillations,  are  known  to  be  solutions 
of  closely  related  nonlinear  dynamical  systems  (Carpenter  and  Grossberg,  1983,  1984, 
1985;  Cohen  and  Grossberg,  1983;  Grossberg,  1980).  Although  a  great  deal  is  now  known 
about  these  several  types  of  dynamical  behavior,  it  remains  to  completely  characterize  the 
mathematical  conditions  under  which  they  obtain.  In  addition,  such  dynamical  systems 
describe  a  new  type  of  dynamical  geometry.  Familiar  geometrical  objects  such  as  curves 
and  surfaces  are  shown  to  be  characterized  by  neural  processes  in  a  manner  that  is  radi¬ 
cally  different  from  the  axioms  of  geometry.  These  systems  also  illustrate  that  the  brain 
realizes  novel  principles  of  nonequilibrium  statistical  mechanics  and  nonstationary  decision 
theory.  New  circuit  designs  for  a  sophisticated  type  of  computer  vision  are  also  suggested. 
Thus  the  present  theory  defines  a  mathematical  framew'ork  in  which  new  mathematical 
ideas  about  perception,  neurobiology,  dynamical  systems,  geometry,  statistical  mechanics, 
decision  theory,  and  artificial  intelligence  are  developing  side-by-side. 

This  theory  has  emerged  from  an  analysis  of  how  brain  designs  achieve  informative 
visual  representations  of  the  external  world  that  are  much  more  veridical  than  the  retinal 
sensory  data  from  which  they  are  derived.  Of  special  interest  are  issues  concerning  how 
distributed  patterns  of  locally  ambiguous  visual  cues  can  be  used  to  generate  unambiguous 
global  percepts  through  the  mediation  of  nonlinear  dynamical  interactions.  The  present 
article  describes  some  of  the  issues  that  led  to  the  theory,  introduces  the  dynamical  systems 
themselves,  and  illustrates  some  of  their  behaviors  using  computer  simulations. 

2.  From  Noisy  Retina  to  Coherent  Percept.  The  limitations  of  sensory  information 
on  the  retina  present  a  formidable  challenge  to  any  perceptual  theory.  For  example,  light 
passes  through  a  thicket  of  retinal  veins  before  it  reaches  retinal  photoreceptors.  The 
percepts  of  human  observers  are  fortunately  not  distorted  by  their  retinal  veins  during 
normal  vision.  The  retinal  veins  are  not  perceived  because  of  the  action  of  mechanisms 
which  attenuate  the  perception  of  images  that  are  stabilized  with  respect  to  the  retina. 
Suppressing  the  percept  of  the  stabilized  veins  is,  however,  far  from  sufficient  to  generate 
an  unambiguous  percept.  This  is  because  the  images  that  reach  the  retina  are  often 
occluded  and  segmented  by  the  veins  in  several  places.  Even  a  single  edge  can  be  broken 
into  several  disjoint  components.  Somehow  in  the  final  percept  broken  retinal  contours  are 
completed,  and  occluded  retinal  color  and  brightness  signals  are  filled-in.  These  completed 
and  filled-in  percepts  are,  in  a  strict  mechanistic  sense,  illusory  percepts. 

Observers  are  not  aware  of  which  parts  of  a  perceived  edge  are  “real”  and  which  are 
“illusory.”  Thus  it  is  not  surprising  that  illusory  percepts  have  provided  important  clues 
for  deriving  the  present  theory,  and  that  the  theory  can  be  used  to  analyse  many  percepts 
of  illusory  boundaries,  colors,  and  brightnesses.  Of  special  interest  are  experiments  of 
Krauskopf  (1963)  and  Yarbus  (1967).  These  experiments  show  that,  if  certain  scenic  edges 
are  artificially  stabilized  with  respect  to  the  retina,  then  colors  and  brightnesses  which  were 
previously  bounded  by  these  edges  can  flow  across,  or  fill-in,  the  percept  until  they  are 
contained  by  the  next  perceptually  significant  boundary  (Figure  Ic).  Such  results  clarify 
how  the  visual  system  synthesized  boundaries,  both  “real”  and  “illusory,”  that  it  selects 
as  perceptually  significant,  and  how  featural  filling-in  occurs  within  these  boundaries. 

The  retinal  veins  are  not  the  only  imperfection  in  the  retinal  image.  The  visual  world 
is  typically  viewed  in  inhomogeneous  lighting  conditions.  The  scenic  luminances  that  reach 
the  retina  thus  confound  fluctuating  lighting  conditions  with  invariant  object  colors  and 
brightnesses.  It  has  been  known  since  the  time  of  Helmholtz  (1962)  that  the  brain  somehow 
discounts  the  spurious  illuminants  to  generate  color  and  brightness  percepts  that  are  more 
veridical  than  those  in  the  retinal  image.  Land  (1977)  has  shown  that  the  perceived 
colors  within  a  picture  constructed  from  overlapping  patches  of  color  are  determined  by 
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Figure  1:  Some  perceptual  illusions,  (a)  A  bright  illusory  disc  is  induced  perpendicular 
to  the  ends  of  the  radial  lines  (Kennedy,  1979).  (b)  A  bright  illusory  square  is  induced 
parallel  to  four  black  pac-man  figures  (Kanizsa,  1974).  (c)  When  the  edges  of  the  large 
circle  and  the  vertical  line  are  stabilized  on  the  retina,  the  red  color  (dots)  outside  the 
large  circle  envelopes  the  black  and  white  hemi-discs  except  within  the  small  red  circles 
whose  edges  are  not  stabilized  (Yarbus,  1967).  The  red  inside  the  circles  looks  brighter 
(right)  or  darker  (left)  than  the  enveloping  red. 


the  relative  contrasts  of  the  edges  between  successive  patches.  The  luminances  within  the 
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interiors  of  each  patch  are  somehow  discounted. 

This  type  of  result  also  points  to  the  action  of  a  filling-in  process.  Were  it  not  possible 
to  fill-in  among  non-discounted  edges,  we  would  perceive  a  world  of  boundaries  or  cartoons. 
Since  edges  are  used  to  generate  the  final  filled-in  percept,  a  good  theory  must  define  edge 
computations  in  a  way  that  can  achieve  this  goal. 

To  explain  how  this  is  accomplished,  the  present  theory  hypothesizes  that  at  least  two 
fundamentally  different  types  of  edge  processing  occur  in  parallel  during  human  visual 
perception.  As  in  the  Land  color  demonstrations,  edge  computations  in  the  present  theory 
are  used  to  generate  a  final  percept,  but  edge  computations  by  themselves  do  not  constitute 
the  final  percept.  These  operations  have  been  used  to  provide  a  physical  interpretation 
and  generalization  (Grossberg,  1984)  of  the  Land  retinex  theory  of  color  and  brightness 
perception  (Land,  1977). 

3.  The  Boundary  Contour  System  and  the  Feature  Contour  System.  The  theory 
asserts  that  two  distinct  types  of  edge,  or  contour,  computations  are  carried  out  within  two 
parallel  systems,  which  we  call  the  Boundary  Contour  System  and  the  Feature  Contour 
System  (Figure  2).  Boundary  Contour  signals  are  used  to  synthesize  the  boundaries  that 
the  visual  system  selects  as  perceptually  significant.  Feature  Contour  signals  initiate  the 
filling-in  processes  whereby  brightnesses  and  colors  spread  until  they  hit  their  first  bound¬ 
ary  contours,  or  are  attenuated  due  to  their  spatial  spread  if  no  boundary  contours  inter¬ 
vene.  The  Boundary  Contour  System  is  defined  by  a  nonlinear  cooperative-competitive 
dynamical  system  whose  inputs  are  derived  from  a  nonlinear  filter.  The  Feature  Con¬ 
tour  System  is  defined  by  a  nonlinear  diffusion  process  whose  inputs  are  derived  from  a 
nonlinear  filter  and  whose  diffusion  coefficients  are  controlled  by  the  Boundary  Contour 
System. 

These  two  types  of  contour  systems  have  not  been  identified  in  previous  perceptual 
theories.  One  reason  for  this  delay  is  that  each  scenic  edge  can  activate  both  systems. 
Only  the  net  effect  of  the  interation  between  systems  is  perceived.  Another  reason  is  that 
boundary  contours  are  not,  by  themselves,  visible.  They  become  visible  by  restricting  the 
filling- in  that  is  triggered  by  feature  contour  signals. 

If  boundary  contours  are,  by  themselves,  invisible,  then  how  can  we  be  sure  that  they 
exist?  This  is  accomplished  by  an  analysis  of  perceptual  data  which,  by  inference,  reveals 
the  different  rules  whereby  these  contours  are  computed.  The  main  rules  are  the  following 
ones. 


4.  Boundary  Contours  and  Boundary  Completion.  The  process  whereby  boundary 
contours  are  built  up  is  initiated  by  the  activation  of  oriented  masks,  or  elongated  receptive 
fields,  at  each  position  of  perceptual  space  (Hubei  and  Wiesel,  1977). 

A.  Orientation  and  Contrast:  The  output  signal  that  is  generated  by  an  oriented  mask 
to  the  next  processing  stage  is  sensitive  to  the  orientation  and  to  the  amount  of  contrast, 
but  not  to  the  direction  of  contrast,  at  an  edge  of  a  scene.  Thus  a  vertical  boundary 
contour  can  be  activated  by  either  a  close-to- vertical  light-dark  edge  or  a  close-to-vertical 
dark-light  edge  at  a  fixed  scenic  position. 

B.  Short-Range  Oriented  Shunting  Competition:  A  mask  of  fixed  orientation  excites 
the  like-oriented  cells  activated  at  its  location  and  inhibits  the  like-oriented  cells  activated 
at  nearby  locations  (stage  w  of  Figure  3a).  In  other  words,  an  on-center  off-surround 
organization  of  like-oriented  cell  interactions  exists  around  each  perceptual  location.  The 
off-surround  inhibition  is  of  shunting,  or  divisive,  type,  rather  than  being  subtractive,  to 
prevent  a  straight  scenic  edge  of  fixed  contrast  from  self- annihilating  its  own  percept. 

C.  Oriented  Tonic  Opponent  Processing:  At  each  perceptual  location,  the  cells  that 
react  to  perpendicularly  oriented  masks  compete  (stage  x  in  Figure  3a).  This  competition 
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Figure  2:  A  inacrocircuit  of  processing  stages:  The  functional  names  of  the  abbreviated 
stages  are:  MP^  =  Left  Monocular  Preprocessing  stage;  MPjj  =  Right  Monocular  Pre¬ 
processing  stage;  BCS  =  Boundary  Contour  Synthesis  stage;  MBC';^  =  Left-Monocular 
Brightness  and  Color  stage;  MBCjj  =  Right-Monocular  Brightness  and  Color  stage;  and 
BP  =  Binocular  Percept  stage.  Neural  interpretations  of  the  abbreviated  stages  have  been 
given  elsewhere  (Grossberg,  1984). 

defines  a  push-pull  opponent  process:  If  a  given  orientation  is  inhibited,  then  its  perpen¬ 
dicular  orientation  is  disinhibited.  Hence  the  opponent  processes  are  tonically  active. 

D.  Normalization  and  Ratio  Scale:  A  final  stage  of  orientational  competition  at  each 
position  tends  to  normalize,  or  adapt,  the  total  output  of  the  boundary  signals  corre¬ 
sponding  to  that  position  (stage  y  of  Figure  3a).  The  orientationally  tuned  suprathreshold 
outputs  corresponding  to  each  position  of  perceptual  space  thus  tend  to  be  ratios  of  a 
conserved  total  activity,  such  that  orthogonal  orientations  cannot  simultaneously  posssess 
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positive  ratios. 

E.  Long-Range  Oriented  Cooperation  and  Boundary  Completion:  Like-oriented  cells 
which  survive  the  competitive  interactions  (A)-(D),  and  which  have  approximately  aligned 
orientations  across  perceptual  space,  can  activate  an  oriented  cooperative  process  z  (Figure 
3a).  The  cooperative  process,  in  turn,  feeds  back  to  the  competitive  process  via  the  chain  of 
processes  z  v  ^  w.  The  feedback  exchange  between  these  competitive  and  cooperative 
interactions  is  capable  of  rapidly  synthesizing  sharp  boundaries,  both  real  and  illusory, 
and  of  rapidly  segmenting  a  scene.  The  dynamical  system  which  defines  the  Boundary 
Contour  System  is  now  described. 

Indices  (i,  j)  denote  position  in  a  two-dimensional  lattice  and  k  denotes  an  orientation. 
Let  Sij  equal  the  input  to  position  Divide  the  oriented  mask  with  position  {i,J)  and 

orientation  k  into  a  left-half  and  a  right-half  Rijk-  Then 

j  _  Wijk  ~  O^Vijk]~^  [^ijk  /I'l 

where 

Uiit  =  H  S„,  (2) 

Vijk=  E  (3) 

(p,q)eRijk 


and  the  notation  [p]'*'  =  max(p,0); 

“  ~^i}k  d"  fi^ijk)  ~  ‘^ijk  E;  f{^pqk)‘^pqiji 

(p.9) 


where 


where 


^pqtj 


lo 


A  if  (p  -  i)^  +  (?  -  jy  <  "^0 
otherwise; 


d 

dt 


'^I]k  ”  —U’ijk  +  I  A  Jijk  +  '^t]k  ~  '^ijk  E  '^pqkBpqij^ 

ip>q) 


tpqij 


\i[p-iy  +  {q-iy<BQ 

\  0  otherwise; 


Jl^ijk  =  A  C[wijk  -  WijKV , 

where  K  is  perpendicular  to  fc; 


~nyijk  —  ~Dyijk  -t-  (£^  —  yijk)^ijk  ~  yijk  E  ^i3m\ 


(4) 

(5) 

(6) 

(7) 

(8) 

(9) 


'Jf^ijk  =  -^tjk  +9(  E  iypqr  ~  ypqR^^pqij^)  4"  ^  (  E  [yp9r  ypqR]^Mij)^  (^9) 

(p,9,r)  (p.9.') 
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where  R  is  perpendicular  to  r,  and  the  cooperative  kernels  Fpq^^  and  are  defined  by 


~  cos(iVp,y  -  r)  \  f\cos{Npqij  -  k)f] 

and 

where  _ _ _ _ 

^pgij  —  y/iP  ~  0^  "t"  (9  ~ 


and 

^pqij  =  arctan^^  -  |) ‘ 

The  nonlinear  signal  functions  /  in  (4)  and  g  in  (10)  are  of  the  form 

/(e)  =  le  -  RV 


and 


siO  ~ 


[e]^ 

r+[^]+- 


(11) 

(12) 

(13) 

(14) 

(15) 

(16) 


Figure  lb  illustrates  the  relevance  of  some  of  these  rules  to  perceptual  phenomena.  In 
Figure  lb,  an  illusory  square  can  be  seen.  The  theory  suggests  that  the  vertical  illusory 
boundary  contours  are  completed  by  the  cooperative  process,  described  in  Section  4E,  in 
response  to  the  vertically  oriented  masks,  described  in  Section  4 A.  An  illusory  square  can 
also  be  seen  if  the  two  black  pac-man  figures  at  the  bottom  of  Figure  lb  are  replaced 
by  white  pac-man  figures,  and  the  white  background  is  replaced  by  a  grey  background. 
The  black  pac-man  figures  form  a  dark-light  edge  with  respect  to  the  grey  background. 
The  white  pac-man  figures  form  light-dark  edges  with  respect  to  the  grey  background. 
The  visibility  of  vertical  illusory  boundaries  shows  that  a  process  exists  that  is  capable  of 
completing  boundaries  between  edges  with  opposite  directions  of  contrast.  The  boundary 
completion  process  is  thus  sensitive  to  orientational  alignment  across  perceptual  space,  as 
in  Section  4E,  and  to  amount  of  contrast  but  not  to  direction  of  contrast,  as  in  Section 
4A. 


5.  Feature  Contours  and  Diffusive  Filling-In.  The  Feature  Contour  process  obeys 
different  rules  of  contrast  than  those  governing  the  Boundary  Contour  process. 

A.  Contrast:  The  feature  contour  process  is  sensitive  to  direction  of  contrast  as  well  as 
to  amount  of  contrast,  unlike  the  boundary  contour  process.  Thus  to  compute  the  relative 
brightness  across  a  scenic  boundary,  one  needs  to  keep  track  of  which  side  of  the  scenic 
boundary  has  a  larger  reflectance.  Direction  of  contrast  is  also  used  to  determine  which 
side  of  a  red-green  scenic  boundary  is  red  and  which  is  green.  The  different  sensitivities  of 
the  two  contour  systems  to  directiori  of  contrast  is  one  of  their  most  important  dissociating 
properties. 

The  Feature  Contour  process  also  obeys  different  rules  of  spatial  interactions  than 
those  governing  the  Boundary  Contour  process. 

B.  Diffusive  Filling~In:  Boundary  contours  activate  a  boundary  completion  process  that 
synthesizes  the  boundaries  which  define  monocular  perceptual  domains.  Feature  contours 
activate  a  diffusive  filling- in  process  that  spreads  featural  qualities,  such  as  brightness  or 
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INPUT  TO  ORIENTED 
COOPERATION 


FEATURE  CONTOUR 


SIGNALS 

Figure  3:  (a)  Boundary  Contour  completion:  An  on-center  off-surround  interaction  be¬ 
tween  like-oriented  cells  representing  nearby  positions  (stage  w)  is  followed  by  tonic  oppo¬ 
nent  processing  between  orthogonally  oriented  cells  at  each  position  (stage  x).  Then  the 
total  suprathreshold  activity  at  each  position  is  normalized  (stage  y)  before  each  orien¬ 
tation  feeds  into  long-range  cooperation  (stage  z)  among  similarly  oriented  and  spatially 
aligned  cells.  Positive  feedback  from  the  cooperative  process  feeds  back  to  like-oriented 
cells  at  the  opponent  processing  stage,  (b)  Feature  Contour  filling-in:  Feature  Contour 
signals  activate  cell  compartments  which  permit  rapid  lateral  diffusion  of  electrical  poten¬ 
tial  across  their  membranes,  except  at  those  membranes  which  receive  Boundary  Contour 
signals  from  the  BCS  stage  of  Figure  2. 
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color,  across  these  perceptual  domains,  as  in  the  Yarbus  (1967)  demonstration  (Figure  Ic). 
Figure  3b  depicts  the  main  properties  of  this  filling- in  process. 

It  is  assumed  that  featural  filling-in  occurs  within  a  array  of  closely  coupled  cell  com¬ 
partments.  A  feature  contour  input  signal  to  a  cell  of  the  array  triggers  a  rapid  diffusion  of 
electrical  potential  across  the  compartment  membranes  of  neighboring  cells.  This  diffusion 
spreads  with  a  space  constant  that  depends  upon  the  electrical  properties  of  both  the  cell 
interiors  zmd  their  membrzmes. 

A  Boundary  Contour  signal  is  assumed  to  decrease  the  diffusion  constant  of  its  target 
cell  membranes  within  the  cell  syncytium.  It  does  so  by  acting  as  an  inhibitory  gating 
signal  that  causes  an  increase  in  membrane  resistance.  At  the  same  time  that  a  boundary 
contour  signal  attenuates  the  filling-in  process  at  its  target  cells,  it  acts  to  inhibit  the 
potentials  of  these  cells.  The  nonlinear  diffusion  process  which  instantiates  properties 
(5 A)  and  (5B)  is  rigorously  defined  in  Cohen  and  Grossberg  (1984b). 

6.  Boundary-Feature  Trade-Off;  New  Axioms  of  Geometry.  The  theory’s  rules 
seem  natural  when  one  realizes  that  the  rules  of  each  contour  system  are  designed  to  offset 
insufficiencies  of  the  other  contour  system.  This  realization  also  leads  to  the  conclusion 
that  neural  representations  of  geometrical  objects  do  not  obey  the  axioms  of  geometry. 
The  Boundary  Contour  system,  by  itself,  could  at  best  generate  a  world  of  boundaries 
or  cartoons.  The  Feature  Contour  system,  by  itself,  could  at  best  generate  a  world  of 
formless  qualities.  Once  one  accepts  that  featural  filling- in  spreads  over  perceptually  am¬ 
biguous  regions  until  reaching  a  boundary  contour,  it  becomes  an  urgent  task  to  synthesize 
boundaries  capable  of  containing  the  featural  flow.  Orientationally  tuned  input  masks,  or 
receptive  fields,  are  needed  to  initiate  the  process  of  building  up  these  boundary  contours. 
Orientationally  tuned  input  masks  are,  however,  insensitive  to  orientation  at  the  ends  of 
lines  and  at  object  corners.  This  breakdown  is  illustrated  by  the  simulation  in  Figure  4a, 
which  depicts  the  reaction  of  a  lattice  of  orientationally  tuned  masks  to  a  thin  vertical 
line  (Grossberg  and  Mingolla,  1985a).  Without  further  processing  of  the  mask  outputs, 
featural  quality  could  flow  freely  out  of  every  line  end  or  corner.  Such  a  flow  does,  in  fact, 
occasionally  occur  in  vivo  in  response  to  certain  scenes,  as  in  neon  color  spreading  (Redies 
and  Spillmann,  1981;  van  Tuijl,  1975),  or  in  the  featural  flow  that  occurs  over  retinally 
stabilized  scenic  edges  (Krauskopf,  1963;  Yarbus,  1967). 

To  offset  this  difficulty  under  normal  circumstances,  we  suggest  that  the  boundary 
system  initiates  the  several  stages  of  competitive  interaction  described  in  Section  4  to 
compensate  for  orientational  insensitivity  at  line  ends  and  at  corners.  Figure  4  shows  how 
these  competitive  interactions  generate  horizontal  boundary  signals  at  the  end  of  the  thin 
vertical  line  that  help  to  prevent  the  flow  of  featural  quality  out  of  the  line.  Such  boundary 
signals  are  said  to  be  generated  by  end  cutting,  or  orthogonal  induction.  Thus  every  line 
end  is  an  illusory  percept  that  is  reconstructed  at  a  high  level  of  visual  processing. 

The  output  pattern  of  the  competitive  process  triggers,  in  its  turn,  a  long-range  ori¬ 
ented  cooperative  process,  which  builds  up  the  completed  boundary  contours.  Computer 
simulations  (Figure  5)  show  that  this  process  is  capable  of  quickly  building  sharp  bound¬ 
aries  that  span  widely  separated  input  masks  while  suppressing  spurious  noise.  Figure 
5  illustrates  that  the  process  which  synthesizes  a  percept  of  line  may  not  even  form  a 
connected  set  until  the  system  approaches  equilibrium.  Thus  a  perceived  line  is,  in  part, 
an  equilibrium  solution  of  a  mixed  cooperative-competitive  nonlinear  feedback  network, 
rather  than  a  connected  set  of  points. 

The  illusory  circle  in  Figure  la  Ccin  now  be  analysed  as  a  result  of  orthogonal  end 
cutting  followed  by  oriented  boundary  cooperation.  Many  previously  mysterious  percepts 
have  been  analysed  as  consequences  of  the  fundamental  Boundary-Feature  Trade-Off.  All 
of  these  analyses  suggest  that  the  way  in  which  we  perceive  geometrical  objects  does  not 
correspond  well  to  the  classical  axioms  of  geometry. 
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Figure  4;  Boundary-Feature  Trade-Off:  (a)  Response  of  an  orientation  field  to  a  thin 
vertical  line  which  looks  blown  up  relative  to  receptive  field  sizes.  Lengths  and  orientations 
of  lines  encode  relative  amounts  of  activation  and  orientations  of  the  oriented  masks  at  the 
corresponding  bar  positions.  Orientational  tuning  breaks  down  at  the  end  of  the  bar.  (b) 
End-cutting  generates  “illusory”  horizontal  activations  at  the  end  of  the  line  in  response 
to  the  orientation  field  in  (a),  to  offset  the  breakdown  in  orientational  tuning.  The  eml- 
cutting  pattern  is  the  output  of  stage  y  of  Figure  3a.  Thus  “every  line  end  is  illusory,”  in 
sharp  contrast  to  the  axioms  of  geometry. 
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Figure  5:  Each  column  depicts  a  different  time  during  the  boundary  completion  pro¬ 
cess.  The  original  input  is  two  noisy  but  vertically  biased  inducing  points  and  a  horizontal 
intervening  noise  element.  The  cooperative-competitive  exchange  triggers  transient  or¬ 
thogonal  inductions  before  attenuating  all  nonvertical  elements  as  it  completes  the  vertical 
boundary. 
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7.  Textural  Segmentation  and  Grouping.  While  the  importance  of  the  Boundary 
Contour  System  is  illustrated  by  its  ability  to  complete  individual  contours,  whether  “real” 
or  “illusory,”  its  rules  also  appear  to  account  for  much  of  the  segmentation  of  textured 
scenes  into  grouped  regions  separated  by  perceived  contours,  many  of  which  have  no  ob¬ 
vious  correlates  in  the  optical  input.  A  contour  in  a  pattern  of  luminances  is  generally 
defined  as  a  spatial  discontinuity  in  luminance.  While  usually  sufficient,  however,  such 
discontinuities  are  by  no  means  necessary  for  sustaining  perceived  contours.  Regions  sep¬ 
arated  by  visual  contours  also  occur  in  the  presence  of  statistical  differences  in  textural 
qualities  such  as  orientation,  shape,  density,  or  color  (Beck,  1966a,  1966b.  1972,  1982, 
1983;  Beck,  Prazdny,  and  Rosenfeld,  1983;  CaelH,  1982.  1983;  Caelli  and  Julesz,  1979). 
Two  findings  of  textural  grouping  research  are  especially  salient.  First,  the  visual  system’s 
segmentation  of  the  scenic  input  occurs  rapidly  throughout  all  regions  of  that  input,  in  a 
manner  often  described  as  “preattentive.”  That  is,  subjects  generally  describe  boundaries 
in  a  consistent  manner  when  exposure  times  are  short  (under  200  msec)  and  without  prior 
knowledge  of  the  regions  in  a  display  at  which  boundaries  are  likely  to  occur.  Thus  any 
theoretical  account  of  boundary  extraction  for  such  displays  must  explain  how  early  “data 
driven”  processes  rapidly  converge  on  boundaries  wherever  they  occur. 

The  second  finding  of  the  experimental  work  on  textures  complicates  the  implications 
of  the  first,  however:  the  textural  segmentation  process  is  exquisitely  context-sensitive. 
That  is,  a  given  texture  element  at  a  given  location  can  be.  part  of  a  variety  of  larger 
groupings,  depending  on  what  surrounds  it.  Indeed,  the  precise  determination  even  of 
what  acts  as  an  element  at  a  given  location  can  depend  on  patterns  at  nearby  locations. 
We  suggest  that  a  boundary  completion  process  underlies  the  context-sensitive  ability  of 
the  visual  system  to  segment  and  group  textured  input. 

A  long  line  of  distinguished  research  by  Jacob  Beck  and  his  colleagues  has  identified 
variables  affecting  textural  segmentation  by  the  human  visual  system  (Beck,  1983;  Beck, 
Prazdny,  and  Rosenfeld,  1983).  Simple  displays  like  the  ones  shown  in  Figure  6  show 
that  the  slopes  of  small  elements  of  color  or  brightness  contrast  are  a  critical  determinant 
of  grouping,  with  regions  containing  many  features  with  similar  slopes  tending  to  group. 
In  particular,  if  certain  of  these  features  are  distributed  in  a  regiilar  manner,  colinear 
groupings  of  these  features  can  become  “emergent  features,”  capable  of  setting  one  textural 
region  apart  from  another.  A  crucial  aspect  of  such  emergent  features  is  that  the  colinear 
arrangement  need  not  be  in  line  with  the  directions  of  the  local  contrasts.  (See  Figure  6). 

Computer  simulations  illustrate  the  ability  of  the  Boundary  Contour  System  to  gen¬ 
erate  perceptual  groupings  akin  to  those  seen  in  Figure  6.  Numerical  parameters  were 
held  fixed  for  all  of  the  simulations;  only  the  input  patterns  were  varied.  As  the  input 
patterns  were  moved  about,  the  Boundary  Contour  System  sensed  relationships  among 
the  inducing  elements  and  generated  emergent  boundary  groupings  among  them.  In  all  of 
the  simulations,  we  defined  the  innut  patterns  to  be  the  output  patterns  of  the  oriented 
receptive  fields,  as  in  Figure  4a,  since  our  primary  objective  was  to  study  the  cooperative- 
competitive  feedback  exchange.  All  possible  oriented  groupings  generated  inputs  to  the 
cooperative-competitive  feedback  process.  Only  the  favored  groupings  survived.  Thus  the 
ability  of  the  network  to  suppress  the  many  incorrect  local  groupings  is  as  important  as 
its  ability  to  choose  the  correct  global  grouping. 

Figure  7a  depicts  an  array  of  four  vertically  oriented  input  clusters.  We  call  each 
cluster  a  Line  because  it  represents  a  caricature  of  how'  a  field  of  oriented  complex  cells 
resond  to  a  vertical  line.  Figure  7b  displays  the  equilibrium  activities  of  the  cells  at  the 
second  competitive  stage  of  our  model.  The  length  of  an  oriented  line  at  each  position  is 
proportional  to  the  equilibrium  activity  of  a  cell  whose  receptive  field  is  centered  at  that 
position  with  that  orientation.  The  input  pattern  in  Figure  7a  possesses  a  manifest  vertical 
symmetry;  Pairs  of  vertical  Lines  are  colinear  in  the  vertical  direction,  whereas  they  are 
spatially  out-of-phase  in  the  horizontal  direction.  The  Boundary  Contour  System  senses 
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Figure  6:  (a)  Emergent  features.  The  colinear  linking  of  short  line  segments  into  longer 
segments  is  an  “emergent  feature”  which  sustains  textural  grouping.  Our  theory  explains 
how  such  emergent  features  can  contribute  to  perceptual  grouping  even  if  they  are  not 
visible  (Gellatly,  1980).  (Reprinted  from  Beck,  Prazdny,  and  Rosenfeld,  1983.)  (b)  The 
diagonal  grouping  at  the  top  of  this  figure  is  initiated  by  differential  activation  of  diagonally 
oriented  receptive  fields,  despite  the  absence  of  any  diagonal  edges  in  the  image.  Horizontal 
cooperation  of  signals  at  the  ends  of  vertical  lines  generates  subjective  contours  in  the 
bottom  half  of  this  figure.  (Adapted  from  Beck,  Prazdny.  and  Rosenfeld,  1983.) 
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Figure  7 :  Computer  simulations  of  processes  underlying  textural  grouping:  The  length 
of  each  line  segment  is  proportional  to  the  activation  of  a  network  node  responsive  to  one 
of  twelve  possible  orientations.  The  dots  indicate  the  positions  of  inactive  cells.  Parts 
(<^)>  l^))  and  .(g)  display  the  activities  of  oriented  cells  which  input  to  the  cooperative- 
competitive  feedback  process.  Parts  (b),  (d),  (f),  and  (h)  display  equilibrium  activities 
of  oriented  cells  at  the  competitive  stage  of  the  Boundary  Contour  System.  A  pairwis(' 
comparison  of  (a)  with  (b),  (c),  and  (d),  and  so  on  indicates  the  major  groupings  senses 
by  the  network. 
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this  vertical  symmetry,  and  generates  emergent  vertical  lines  in  Figure  7b.  In  addition, 
the  Boundary  Contour  System  generates  to  horizontal  end  cuts  at  the  ends  of  each  Line, 
which  can  trap  the  featural  contrasts  of  each  line  within  the  Feature  Contour  System. 

In  Figure  7c  the  input  Lines  are  moved  so  that  pairs  of  Lines  are  colinear  in  the  vertical 
direction  and  their  Line  ends  are  lined  up  in  the  horizontal  direction.  Now  both  vertical 
and  horizontal  groupings  are  generated  in  Figure  7d.  In  Figure  7e  the  input  lines  are 
shifted  so  that  they  become  non-colinear  in  a  vertical  direction,  but  pairs  of  their  Line 
ends  remain  aligned.  The  vertical  symmetry  of  Figure  7c  is  hereby  broken.  Thus  in  Figure 
7f  the  Boundary  Contour  System  groups  the  horizontal  Line  ends,  but  not  the  vertical 
Lines. 

Figure  7h  depicts  a  more  demanding  phenomenon:  the  emergence  of  diagonal  groupings 
where  no  diagonals  exist  in  the  input  pattern.  Figure  7g  is  generated  by  bringing  the  two 
horizontal  rows  of  vertical  Lines  closer  together  until  their  ends  lie  within  the  spatial 
bandwidth  of  the  cooperative  interaction.  Figure  7h  shows  that  the  Boundary  Contour 
System  senses  diagonal  groupings  of  the  Lines.  These  diagonal  groupings  emerge  on  both 
microscopic  and  macroscopic  scales.  Thus  diagonally  oriented  receptive  fields  are  activated 
in  the  emergent  boundaries,  and  these  activations,  as  a  whole,  group  into  diagonal  bands. 

These  figures  illustrate  that  the  Boundary  Contour  System  behaves  like  an  on-line  sta¬ 
tistical  decision  theory,  sensing  only  those  groupings  of  perceptual  elements  with  enough 
“statistical  inertia”  to  drive  its  cooperative-competitive  feedback  exchanges  towards  a  non¬ 
zero  stable  equilibrium  configuration.  One  can  interpret  the  distribution  of  oriented  ac¬ 
tivities  at  each  input  position  as  being  analogous  to  a  local  probability  distribution,  and 
the  final  Boundary  Contour  System  pattern  as  being  the  global  decision  that  the  system 
reaches  and  stores  based  upon  all  of  its  local  data.  In  contrast  to  stochastic  relaxation  al¬ 
gorithms  for  boundary  detection  (Geman  and  Geman,  1984)  wherein  a  formal  temperature 
parameter  is  slowly  decreased  to  drive  the  system  towards  a  minimal  energy  configuration 
with  boundary  enhancing  properties,  no  Boundary  Contour  System  parameter  is  manip¬ 
ulated  by  an  external  agent.  The  Boundary  Contour  System  system  internally  regulates 
its  own  convergence  to  a  coherent  configuration  via  its  cooperative-competitive  feedback 
exchanges.  The  input  patterns  themselves  are  the  only  “external  parameters”  that  are 
altered  in  our  system.  Changing  the  external  input  pattern  can  cause  a  global  switch,  or 
phase  transition,  in  the  network,  as  in  Figure  7.  In  each  new  mode,  the  network  can  main¬ 
tain  a  different  coherent  organization  via  its  cooperative-competitive  feedback  loops.  The 
network  can  sustain  a  large  number  of  different  coherent  configurations  using  its  nonlinear 
dissipative  dynamics  (Prigogine,  1978),  rather  than  a  conservative  Hamiltonian  system. 

The  present  simulations  were  generated  by  solving  large  systems  of  nonlinear  differen¬ 
tial  equations  on  a  traditional  computer.  These  simulations  suggest  that  many  seemingly 
esoteric  Gestalt  rules  for  grouping  unambiguous  wholes  from  ambiguous  parts  can  be  ef¬ 
fected  by  networks  of  cells  embodying  a  small  number  of  Boundary  Contour  System  stages. 
The  local  intelligence  required  to  implement  these  network  designs  is  far  less  than  that  re¬ 
quired  of  a  traditional  computer  central  processing  unit.  The  intelligence  of  the  networks 
comes  from  their  embodiment  of  nonclassical  geometrical  properties  that  are  inherently 
context-sensitive.  An  appropriate  hardware  implementation  of  the  Boundary  Contour  Sys¬ 
tem  would  segment  and  group  complex  and  noisy  images  in  a  context-sensitive  manner 
and  in  real-time. 


8.  A  Synthesis  of  Nonlinear  Dynamics,  Parallel  Computation,  and  Image  Pro¬ 
cessing.  The  deepest  conceptual  issues  raised  by  the  present  results  concern  the  choice 
of  perceptual  units  and  design  principles.  Local  computations  of  scenic  elements  cannot 
provide  an  adequate  understanding  of  visual  perception,  if  only  because  most  luminance 
changes  are  discounted  as  spurious  by  the  human  visual  system.  The  Boundary-Feature 
Trade-Off  shows  that  the  visual  system  is  designed  in  a  way  that  is  quite  different  from  any 
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possible  local  computational  theory.  A  fertile  source  of  new  ideas  about  parallel  computa¬ 
tion,  image  processing,  geometry,  and  statistical  mechanics,  no  less  than  about  perceptual 
and  neural  theory,  can  thus  be  found  in  the  collective  properties  of  these  very  large  systems 
of  nonlinear  differential  equations. 
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I.  INTRODUCTION 


A.  UnrkgfQumL. 

Determining  the  vulnerability  of  combat  vehicles  and  olhcr  systems  to  the  wide  range  of 
threats  presented  in  modern  warfare  can  be  a  daunting  task.  Although  physics,  engineering, 
operations  research,  and  other  disciplines  offer  an  invaluable  framework  for  addre.ssing  (he 
problem,  these  morc>or*le6S  hard  sciences  cannot  by  themselves  provide  complete  soluUo[i.s. 
On  the  contrary,  practitioners  of  vulnerability  analysis  will  assure  you  that  their  profession  is  at 
least  as  much  an  admixture  of  art,  intuition,  and  educated  guesswork  as  a  science. 

Traditional  computer  techniques,  although  useful,  have  been  unable  to  master  the  entire 
problem.  It  was  to  address  this  problem  that  the  artificial  intelligence  (AI)  community 
developed  the  technology  that  has  come  to  be  called  expert  tj/elem$.  A  true  expert  system  is  a 
program  that  mimics  the  performance  of  a  human  expert  in  some  intellectual  endeavor.  The 
archetypal  expert  system  attains  this  high  level  of  proficiency  by  embodying  the  heuristic,  infor* 
maliy  framed  knowledge  of  the  human  expert,  along  with  the  expert's  oot>always  rigorous 
methods  of  reasoning  in  the  subject  domain. 

We  are  building  expert  systems  to  deal  with  vulnerability  analysis.  Toward  this  end,  wc 
have  developed  a  gener^-purpose  mechanism  (called  an  inference  engine  in  Al  Icrminotog)) 
with  which  to  emulate  the  behavior  of  human  experts.  The  project,  developed  in  Frani  LISP 
on  a  VAX-11/750,  is  called  Genie,  and  this  paper  offers  a  presentation  of  its  design,  its  imple¬ 
mentation,  and  its  usage. 

As  our  first  application  of  Genie,  wc  are  working  with  Walter  Thompson  and  Steven 
Polyak  of  the  Aerial  Targets  Branch,  VLD,  on  an  expert  system  to  assist  human  experts  in 
assessing  the  vulnerability  of  turbine  jet  engines.  Many  of  the  examples  cited  in  this  paper  can 
be  tliought  of  as  being  taken  from  such  a  system,  although  authenticity  of  domain  details  will 
sometimes  suffer  for  the  sake  of  illustrative  clarity. 

D.  Typical  Architecture  of  an  Bxnert  System. 

The  highest  level  of  organiralion  of  an  expert  system  is  not  complex,  as  figure  1  illus¬ 
trates.  A  basic  tenet  of  the  expert  system  methodology  requires  separation  of  domain  expertise 
(whether  it  be  in  infectious  diseases  or  turbine  engine  vulnerability)  from  the  strategics  for 
manipulating  and  applying  that  expertise.  The  resultant  modularity  greatly  facilitates  debugging 
and  modifying  either  subsystem. 


Fk;uri:i  -  Sg.-lcm  Overview 

The  fir^t  of  (he  stibsysb*iii.s  is  c:dled  (he  knmvledge  It  is  (he  m:i'>s  of  expertise  (hat 

has  been  rolirrted  from  one  or  several  hiiiiian  rx|>erl.s.  Tlie  knowledge  l»;ise  will  iypii::Jly  con¬ 
tain  both  dcclar.alivc  kintwleilge  (e.g.  (hat  the  specifie  gravily  of  steel  is  7.K)  and  proeedurid 
knowledge  (c.g.  that  a  means  of  deteriiiiiiiiig  the  s|s'cifir  gravity  of  an  arlnlrary  solid  is  to 
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calculate  the  ratio  of  the  solidus  density  to  the  density  of  water).  Procedural  knowledge  is  often 
expressed  in  rules  of  propositional  implication  (e.g.  that  if  a  solid  is  known  to  be  roughly  as 
dense  as  steel  then  it  can  be  deduced  that  the  solid’s  specific  gravity  is  approximately  equal  to 
that  of  steel).  The  process  of  constructing  a  knowledge  base,  of  determining  how  an  ex|>ert 
solves  problems  and  implementing  that  knowledge  on  a  computer,  is  called  knowledge  engineer¬ 
ing. 

The  second  subsystem  is  the  inference  engine.  In  a  sense,  the  inference  engine  is  the  pro¬ 
gram  that  acts  upon  the  knowledge  base  as  data.  Optimally,  it  would  be  so  well  separated  from 
the  knowledge  base  and  so  general  that  one  could  plug  in  a  different  knowledge  base  and 
thereby  create  anew  expert  system  in  an  unrelated  domain. 

Though  it  might  not  be  as  fundamental,  an  interface  between  the  system  and  its  users  is 
DO  less  important  than  the  knowledge  base  or  the  inference  engine.  The  user  interface  should 
allow  a  non-programmer  to  modify  the  knowledge  base,  obtain  consultative  assistance,  and 
interrogate  the  system  about  the  reasons  for  its  behavior,  all  in  language  that  is  clear  and 
natural  to  the  user. 


II.  REPRESENTATION  OF  KNOWLEDGE 

A  key  subprobiem  in  modeling  the  behavior  of  a  human  expert  is  to  choose  appropriate 
formats  for  expressing  and  storing  the  expert's  knowledge  so  that  a  computer  can  process  the 
knowledge  effectively  and  efficiently.  In  this  section  we  discuss  the  approach  to  knowledge 
representation  that  has  been  employed  in  building  Genie.  The  discussion  is  presented  in  iv^o 
parts:  first  we  consider  the  syntax  of  our  representation  scheme,  the  formats  available  for  use; 
and  then  we  consider  their  semantic  content,  what  mental  constructs  these  structures  represent 

A.  Tools 

1.  Frames.  The  data  structure  that  is  most  widely  used  by  Genie  is  called  a  frame. 
Frames  are  a  method  of  organizing  information  about  an  object  and  its  properties,  together  with 
pointers  to  other  frames  describing  related  objects.  A  frame  is  composed  of  slots,  each  of 
which  contains  a  piece  of  information.  Figure  2  illustrates  the  kinds  of  information  that  might 
be  stored  in  a  frame. 


Comp.T58 

A  Kind  Of 

compressor 

Type 

axial 

Manufacturer 

General  Electric 

Pound  In 

T58  turbine  engine 

Pressure  Ratio  * 

8.3  :  1 

Air  Mass  Flow 

5.6  kg/s 

Blade  Material 

steel 

Exit  Pressure 

calculate;  intake  pressure 
times  pressure  ratio 

Figure  2.— Typical  Frame 
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or  course,  if  frames  are  to  be  used,  then  special  functions  will  be  required  to  manipulate 
them.  We  have  implemented  a  set  of  routines  to  retrieve  and  store  information,  and  to  per¬ 
form  other  such  tasks  in  the  manner  described  by  Winston  and  Horn*  and  by  Roberts  and 
Goldstein®. 

2.  Value  Ranees.  Arithmetic  manipulation  of  data  is  a  central  theme  of  computer  sci¬ 
ence.  Expert  systems  do  not  manipulate  numbers  as  often  as  they  do  more  general  symbols, 
but  a  means  to  perform  numeric  computation  is  unquestionably  among  Genie's  requirements. 
Unfortunately,  the  kinds  of  numeric  knowledge  which  Genie  must  handle  carry  the  same 
imprecision  and  uncertainty  that  first  motivated  the  development  of  expert  systems.  There  are 
in  the  literature  many  approaches  to  dealing  with  the  problem  of  imprecise  and  uncertain 
knowledge,  some  of  which  will  be  discussed  in  section  VIII  below.  Another  representation 
problem  can  turn  up  even  with  knowledge  that  is  perfectly  precise,  crisp,  and  certain.  Some 
knowledge  is  inescapably  cast  as  information  about  sets,  vectors,  or  intervals,  rather  than 
scalars. 


We  have  developed  a  representation  format  to  deal  with  both  of  these  problems.  Using 
what  we  call  value  range$  and  the  routines  for  manipulating  them,  one  can  represent  either 
intervals  on  the  real  line  or  imprecisely  known  scalars.  A  value  range  record  consists  of  six 
cells  which  can  be  labeled  as  in  figure  3. 


< 


Figure  3. — Formal  of  a  Value  Range  Record 


Either  of  the  first  two  cells  can  be  used  for  a  lower  bound  on  the  value.  Similarly,  the  last  two 
cells  can  bound  the  value  above.  The  ^  cell  can  contain  a  set  of  prohibited  values.  The  a^cell 
can  only  hold  one  value;  if  it  is  non-empty,  then  all  the  other  cells  must  be  empty.  As  an 
example,  the  value  range  in  the  proposition  that 

0<x  <10,  I  note  \j,  6,  7j  (1) 

would  be  represented  by  the  record 


0 

(— .  6,  7) 

10 

12  *  ’  / 

Figure  4. — Inequality  (1)  Represented  as  a  Value  Range 


One  way  that  Genie  uses  value  ranges  is  in  representing  rules  in  the  knowledge  base.  For 
example,  a  knowledge  base  might  contain  the  following  rule, 

"If  thrust  of  engine  <  6000  lbs,  or 
thrust  of  engine  >  13,000  lbs, 
then  engine  is  not  J57." 

Another  use  for  value  ranges  is  Genie’s  facility  for  interpreting  a  user’s  answers  to  its 


*  P.  H.  Winston  »od  B.  K.  P.  Honi-  LISP.  Addisoo-Wosley,  1081.  pp.  201-301. 

2 

R.  B.  Roberta  And  1.  P.  Goldstein.  Tht  FRL  Prtmtr  Memo  No.  400,  ArtificiaJ  Intelligence  Laboratory, 
MIT.  1077. 
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questions.  In  answer  to  the  question,  “What  is  the  thrust  of  engine?**  the  user  may  enter, 
“5000  thrust  <  7500“,  There  are  several  functions  that  operate  on  value  ranj^es,  includ¬ 
ing  routines  to  build  a  record,  to  determine  if  a  hypothesized  value  conflicts  with  existing 
knowledge,  and  to  improve  the  precision  of  the  system's  estimate  of  a  value. 

B.  S^maatjci 

For  the  most  part,  Genie  manipulates  knowledge  that  is  expressed  in  propositional  impli¬ 
cations  known  as  production  rules,  or  productions.  A  production  links  one  group  of  propositions, 
called  its  antecedents,  to  a  second  group  of  propositions,  called  conclusions,  The  individual  pro¬ 
positions  are  represented  by  fact  frames,  and  the  productions  by  rule  frames.  A  third  type  of 
frame  used  by  Genie  is  the  concept  frame.  All  three  frame  types  have  two  states.  The  sialic 
state  consists  of  intrinsic  information  and  relations;  it  is  run-time  invariant.  The  dynamic  state 
adds  the  results  of  all  the  manipulations  performed  during  the  current  execution. 

1*  tlules-  In  practice,  a  rule  frame  often  contains  just  a  list  of  preconditions  for  applying 
the  rule  together  with  a  list  of  deductions  that  result  from  its  application.  These  are  stored  in 
the  Ifi  and  Thens  slots,  respectively.  Since  a  proposition  and  its  negation  describe  the  same 
knowledge,  we  use  one  fact  frame  to  represent  both.  Therefore,  propositions  appearing  in  rules 
must  be  marked  to  specify  which  sense  should  be  used.  As  an  example,  consider  ruleG,  whose 
frame  appears  in  figure  5a.  It  states  that  if  factlQ  and  factCO  are  known  to  be  true  and  fact3  is 
known  to  be  false  then  fact21  is  true  and  can  be  added  to  the  knowledge  base. 

Once  the  three  preconditions  are  met  and  the  deduction  is  made,  nileO  takes  the  form 
given  in  figure  5b,  In  this  case,  ruleG  is  said  to  have  fired.  If,  however,  fact20  were  determined 
to  be  false,  the  rule  would  not  fire  and  nothing  would  be  learned  about  fact2r5  truth  value. 
This  situation  is  illustrated  in  figure  5c, 

Flexibility  in  specifying  a  rule's  preconditions  is  provided  by  the  must-have  mechanism 
shown  in  figure  5d.  The  ruIeSS  frame  contains  three  antecedents.  The  Muat^have  slot  indi¬ 
cates  that  if  any  two  of  the  antecedents  can  be  determined  to  hold,  then  the  rule  can  fire.  The 
default,  for  rules  without  Must-^have  slots,  requires  all  the  preconditions  to  be  met. 

2.  Facts.  The  fact  is  the  basic  semantic  building  block  in  a  Genie  knowledge  base.  Rules 
are  built  up  from  them,  and  the  inference  engine  attempts  to  deduce  or  verify  them.  When  the 
user  is  asked  a  question,  it  is  in  order  to  acquire  new  facts.  And  the  system's  ultimate  answer 
is  some  fact.  The  internal  representation  of  facts  is  illustrated  in  figure  6. 

The  frame  factlT  (figure  6a)  represents  a  proposition  about  a  compressor.  The  English- 
language  statement  of^that  proposition  is  found  in  the  Stmt  slot.  The  Default  slot  is  available 
to  specify  which  sense  of  a  proposition  (viz.  its  affirmation  or  its  negation)  should  be  assumed 
in  the  event  that  its  truth  value  cannot  be  determined  directly.  The  slot  lists  all  the  rules 

in  the  knowledge  base  whose  application  depends  upon  the  truth  value  of  factl?.  In  particular, 
factl7  is  in  the  Ift  slots  of  rules  five,  twenty-seven,  and  twenty-eight.  The  Thena-of  slot,  on 
the  other  hand,  lists  all  the  rules  that,  if  applied,  will  determine  factl7's  truth  value:  ractl7  is 
in  the  Thens  slots  of  rules  16  and  105.  This  linking  of  facts  to  the  rules  that  use  them  speeds 
execution  and  facilitates  explaining  system  behavior  to  the  user. 

The  Xor  slot  of  faetl7  contains  the  name  of  a  group  of  mutually  exclusive  propositions. 
If,  by  whatever  means,  factl?  is  determined  to  be  true,  then  Genie  will  conclude  that  all  the 
other  facts  in  xor3  are  false.  This  feature  is  a  handy  way  to  model  hierarchies  of  disjoint  classes 
of  objects  (e.g.  Whether  a  target  is  of  RMA,  mild  steel,  or  aluminum).  It  also  provides  a 
means  of  representing  the  relation  between  antonyms,  as  for  example,  whether  compressor 
vane  geometry  is  variable  or  fixed. 

Suppose  Genie  was  running  and  applied  rulelG.  As  a  result,  it  would  amass  dynamic 
knowledge  about  the  truth  value  of  various  facts.  If  rulelG's  assertion  about  factl?  were  in  the 
affirmative,  then  factl7  would  be  modified  to  the  form  shown  in  figure  6b,  The  How  slot 
could  be  checked  at  a  later  time  to  determine  the  context  in  which  factl7’s  truth  was  deter¬ 
mined. 
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Figure  S. — Sample  Rule  Frames, 

(a)  The  etatic  version  of  a  simple  rule. 

(b)  The  dynamic  version  of  a  successful  application  of{d). 

(c)  A  dynamic  version  of  a  failed  application  (?/(a). 

(d)  A  mustr^have  rule. 


Propositions  concerning  numeric  knowledge  can  be  represented  using  what  we  call  arith¬ 
metic  facts,  as  illustrated  in  figure  6c.  The  Arithmetic  slot  of  an  arithmetic  fact  frame  has  three 
sub-cells  (called  facets  in  frame  terminology),  which  provide  the  links  by  which  to  confirm  or 
disprove  ract23.  According  to  the  concept  and  atb*  facets,  fact23  concerns  the  attribute  called 
speed  of  an  object  called  spool.  Cenie  determined  that  fact23  was  false  by  looking  in  the  spool 
frame  to  compare  what  it  knew  about  spool  speed  with  the  relation  stored  in  the  relat  facet  of 
fact23- 

3.  Concepts.  Rule  and  fact  frames  provide  a  significant  increase  in  deductive  speed,  but 
they  are  little  more  than  an  extension  of  the  production-rule  approach  to  building  expert  sys¬ 
tems.  One  of  the  major  advantages  of  production  rules  over  other  semantic  structures  is  their 
modularity.  Rules  represent  small  pieces  of  knowledge  and  can  be  added  to  a  knowledge  base 
or  modified  easily,  and  undesired  side-effects  are  less  common  than  with  more  complex 
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Figures. — Sample  Fad  Frames. 

(a)  The  static  version  of  a  simple  fact,  (b)  A  dynamic  version  o/(a). 
(c)  A  dynamic  Version  of  an  arithmetic  fact. 


structures*  Unfortunately,  though,  the  modularity  of  production  rules  also  represents  one  of 
their  most  serious  drawbacks*  Because  a  knowledge  base  made  up  of  rules  has  such  a  fine 
granularity,  it  is  difficult  to  ascertain  high*level  patterns  and  order  in  the  knowledge.  This 
problem  is  especially  serious  when  a  user  tries  to  understand  the  system’s  lines  of  reasoning,  or 
when  a  student  tries  to  acquire  the  expertise  inherent  in  the  knowledge  base.^ 

As  a  further  step  towards  representing  the  order  in  an  expert’s  knowledge,  Genie  uses 
concept  frames  to  group  information.  In  the  discussion  of  arithmetic  f;icts,  above,  we  saw  one 

- f  . . —  ■  - 

A.  Barr  *nd  E.  A.  Feigenbaum,  eds.  HanJiook  0/  ArUfUiol  Inttllijtnce,  vol.  1.  Williim  Kaurmann, 

Inc.,  1081.  pp.  leS-lOf 
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circumstance  in  which  concept  frames  are  used.  Proceeding  with  that  example,  we  shall  trace 
Genie’s  process  of  determining  the  truth  value  of  fact23.  The  system’s  knowledge  about 
several  of  the  spool’s  attributes  are  stored  in  spool,  which  might  take  the  form  shown  in  figure 
7. 
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facMO  factOS 
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0  -  500 

used-by 

fact43  ract44  factCO  factOO 

asked 

Yes 

Figure?, — SampU  Concept  FVame. 


Originally,  there  was  insufricient  information  in  the  speed  slot  of  spool  to  determine  whether 
fact23  held.  So  Genie  asked  the  user,  ‘*What  is  the  speed  of  spool?*'  The  user's  response  was 
presumably  the  scalar  value  9590,  since  that  is  what  the  value-range  facet  indicates.  When  the 
=  cell  of  a  value-range  is  non-empty,  its  contents  is  also  stored  in  the  value  facet. 

Neither  factlO  nor  fact05  has  been  needed  yet,  since  the  length  slot  retains  its  static  con¬ 
figuration.  But  one  of  the  four  propositions  about  spool  mass  was  needed,  since  the  asked 
facet  of  mass  is  full.  If  one  of  the  three  remaining  facts  asserted  that  spool  mass  equaled  200, 
then  Genie  would  be  incapable  of  determining  that  fact's  truth  value  by  direct  calculation.  This 
is  because  Genie  will  only  request  input  of  a  given  parameter  once,  on  the  assumption  that  the 
user  will  have  given  his  best  estimate  immediately. 


III.  REASONING  METHODS 

The  basic  intent  in  developing  expert  systems  is  to  enable  a  computer  to  reason  as  a 
human  expert  does.  By  this  we  mean  to  achieve  expert  performance^  since  our  goal  is  not  so 
much  modeling  the  expert’s  behavior  as  modeling  the  results  of  that  behavior.  A  parallel  can 
be  drawn  with  the  expert  system's  representation  of  knowledge:  one  seeks  a  format  that  is  iso¬ 
morphic  to  the  one  used  by  the  expert's  brain,  but  replication  is  neither  necessary  nor  possible. 
For  example,  we  do  not  assert  that  frames  or  value-range  records  exist  in  the  human  brain. 
But  neither  are  silicon  chips  identical  to  cortical  neurons*  In  the  same  way,  while  formal  logic 
is  seldom  applied  by  humans  to  real-world  problems,  its  spirit  can  be  fundamentally  useful  in 
developing  expert  systems. 

Given  a  body  of  formal  assertions  and  implications,  or  propositions  and  production  rules, 
there  are  basically  two  possible  strategies  for  using  them.  One  stri^lcgy  focuses  on  the  rules’ 
antecedents,  the  other  on  their  conclusions.  They  are  called  forward  and  backward  chaining ^ 
respectively,  and  will  be  explained  below. 

A.  ForwaT:,(^.Chaipmg 

In  forward  chaining,  one  compares  the  facts  in  the  knowledge  base  with  the  antecedents 
of  the  various  rules,  trying  to  fire  any  rules  possible.  When  a  rule  fires,  its  conclusions  are 
added  to  the  knowledge  base  and  can  potentially  trigger  other  rules.  Because  attention  is 
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focused  OD  matching  antecedents  against  the  facts  that  are  known,  this  method  is  also  called 
data-driven  reasoning. 

Forward  chaining  is  often  extremely  useful  in  real*time  applications.  In  these  settings,  the 
rules  often  represent  event/response  or  condition/action  knowledge.  Robotics  and  industrial 
process  control  are  examples. 

D.  p^Tckward  Chaining 

The  second  strategy  is  somewhat  more  complicated.  In  backward  chaining  one  starts  by 
considering  a  goal,  in  this  case  a  fact  whose  truth  value  is  desired.  The  key  step  is  to  find  a 
way  of  determining  the  truth  value,  which  usually  means  finding  a  rule  that  draws  a  conclusion 
about  the  fact.  If  such  a  rule  exists,  then  one  can  reformulate  the  original  problem  as  the 
determination  of  the  truth  values  of  each  of  the  rulers  antecedents.  This  reformulation  of 
problems  into  subproblems  is  iterated  until  each  of  the  subproblems  can  be  solved  directly, 
either  by  finding  facts  in  the  knowledge  base,  or  by  asking  questions  of  the  user.  For  this  rea¬ 
son,  backward  chaining  is  also  called  goal-driven  reasoning. 

C.  Genie's  Approach 

Reasoning  in  the  current  version  of  Genie  has  a  dual  nature.  From  a  holistic  viewpoint, 
the  control  strategy  is  entirely  goal-driven.  Equally  valid,  however,  is  the  reductionist  point  of 
view  that  all  the  system deductions  are  made  in  data-driven  mode.  The  two  arguments  are 
presented  here. 

A  Genie  knowledge  base  must  contain  at  least  one  fact  tagged  as  a  hypothesis,  or  top-level 
goal.  The  inference  engine  records  all  the  hypotheses  and  tries  to  verify  each  one  in  succes¬ 
sion.  Given  a  hypothesis,  Genie  looks  at  the  Thena-of  slot  in  its  fact  frame.  This  provides  a 
list  of  rules  that  could  potentially  confirm  the  hypothesis.  These  rules’  Ifa  slots  specify  other 
facts  which  Genie  treats  as  subgoals.  The  subgoals  will  generally  have  additional  rules  in  their 
own  Theni-of  slots,  and  so  on.  Following  all  these  Hiens^of  and  Ifi  links  as  far  as  they  lead 
would  generate  a  tree  structure  with  facts  as  nodes  and  rules  as  edges.  The  leaves  of  this  tree 
—  facts  that  cannot  be  deduced  from  any  rule  in  the  knowledge  base  —  constitute  the  informal 
tion  Genie  must  request  from  the  user,  and  are  consequently  the  simplest  possible  subgoats. 

Consider  the  lowest  level  of  deduction  in  this  process:  a  rule  whose  antecedents  are  all 
leaves.  A  question  will  be  asked  for  each  fact,  the  user's  answers  being  added  to  the  knowledge 
base  in  a  process  called  memorization.  Assuming  that  the  conditions  specified  by  the  rule's 
antecedents  conform  to  the  answers,  these  subgoals  are  all  achieved.  So  the  rule  fires,  causing 
its  conclusions  to  be  memorized,  and  a  larger  fraction  of  the  problem  has  been  solved.  The 
procedure  continues  in  this  fashion. 

Not  all  the  rules  will  fire,  of  course.  A  rule  fails  if  its  antecedents  do  not  conform  to  the 
circumstances  of  the  present  run.  When  this  happens,  the  desired  conclusion  must  be  achieved 
through  other  means.  If  the  fact  has  untried  rules  in  its  Thena^of,  they  will  be  tried.  If  not, 
the  fact  frame  will  be  searched  for  a  Default  If  all  the  rules  that  might  confirm  a  hypothesis 
fail,  then  the  hypothesis  is  discarded,  and  another  one  tried. 

Whereas  backward  chaining  imposes  order  on  Genie’s  performance,  it  is  forward  chaining 
that  actually  deduces  facts.  Every  time  a  fact  is  memorized,  whether  it  was  given  by  the  user  or 
deduced  from  some  rule,  forward  chaining  is  performed  on  it.  The  fact’s  slot  lists  the 

rules  that  require  it.  Each  one  of  these  rules  that  has  not  already  fired  or  failed  is  considered. 
If  any  rule's  antecedent  requires  the  wrong  truth  value  for  the  fact,  then  the  rule  fails.  How¬ 
ever,  if  the  newly  memorized  fact  conforms  with  the  only  unfulfilled  precondition  of  a  rule, 
then  that  rule  will  fire.  Genie  will  then  forward  chain  on  each  of  the  rule’s  conclusions  in  turn. 

TJie  double  linking  of  facts  and  rules  makes  Genie’s  knowledge  representation  scheme 
very  flexible.  Backward  chaining  is  achieved  by  following  the  Thena-^of  and  Ifa  links.  Simi¬ 
larly,  following  Ifb^of  and  Thcna  links  accomplishes  forward  chaining.  Thus,  one  representa¬ 
tion  of  a  rule  can  be  used  for  both  strategies.  This  capability  is  a  very  important  one,  since 
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using  only  one  or  the  other  strategy  has  serious  drawbacks.  A  system  that  only  forNvard  chains 
often  seems  to  behave  erratically,  jumping  around  the  knowledge  base.  Furthermore,  a  purely 
data-driven  system  cannot  even  ask  the  user  any  questions,  since  it  can  only  passively  obtain 
facts  and  apply  rules  to  them.  On  the  other  hand,  a  strictly  backward  chaining  system  draws 
only  those  conclusions  that  are  of  immediate  use.  So  it  will  miss  drawing  conclusions  that  are 
supported  by  its  knowledge  but  do  not  lie  in  the  direct  path  of  its  current  task. 

In  summary,  backward  chaining  causes  Genie’s  reasoning  to  be  directed  from  the  broad¬ 
est,  most  general  conclusions,  through  ever  more  specific  facts.  This  top-down  behavior 
creates  the  impression  that  the  system  is  acting  purposefully.  Within  that  context,  forward 
chaining  ensures  that  deductions  are  made  as  soon  as  the  necessary  knowledge  is  acquired. 


IV.  USER  INTERFACE 

The  two  previous  sections  discussed  Genie’s  layout  and  performance  from  an  internal 
viewpoint  Here  we  shall  describe  the  face  that  Genie  shows  to  humans  who  interact  with  it  at 
a  computer  terminal.  First  we  consider  the  outermost  layer  of  the  program  that  mediates  com¬ 
munication  between  the  user  and  Genie.  Then  the  specific  modes  of  giving  input  to  the  system 
are  addressed.  Finally,  we  discuss  Genie’s  ability  to  use  in  its  communication  something 
approaching  normal  English  grammar. 

A*  Driver 

Upon  invoking  Genie,  one  interacts  with  a  function,  called  Driver,  that  directs  Genie’s 
operation.  Anyone  who  invokes  Driver  will  be  categorized  into  one  of  three  access  classes: 
user,  rule-writer,  or  programmer.  The  user  class  is  the  broadest.  It  includes  those  who  use  the 
system  for  production  runs.  Driver  protects  general  users  and  Genie  from  one  another,  by  res¬ 
tricting  both  the  commands  the  user  may  use  and  the  actions  Genic  may  perform  for  the  user 
The  rule-writer  class  is  intended  to  include  the  knowledge  engineers  who  maintain  the  system. 
A  rule  writer  may  execute  any  of  the  commands  available  to  the  general  user.  In  addition,  the 
rule  writer  may  modify  the  knowledge  base  and  dig  more  deeply  into  Genic’s  internal  mechan¬ 
isms  to  determine  what  the  system  is  doing  and  why.  The  programmer  class  is  the  least  restric¬ 
tive.  Programmers  may  execute  any  of  the  commands  available  to  the  rule  writer  plus  several 
lower-level  commands  to  debug  the  inference  engine  and  peripheral  components. 

Driver  understands  a  number  of  commands,  the  most  important  of  which  is  run,  the 
request  to  start  up  th^  expert  system.  One  can  also  instruct  Driver  to  show  all  the  rules  that 
have  been  applied,  or  all  the  facts  or  numerical  values  that  have  been  derived  so  far  in  the 
current  run.  It  is  also  possible  to  view  rules  and  facts,  either  in  an  English  form  or  in  a  frame 
form  that  is  more  like  their  internal  representations.  Concept  frames  can  be  displayed  in  simi¬ 
lar  fashion. 

B.  Run-Time  Input  Functions 

When  Genie  requires  information  from  the  user,  there  are  three  schemes  by  which  it  can 
request  it.  The  first  and  simplest  of  these  is  the  simple  yes/no  question.  The  user  may 
respond  to  a  yes/no  question  by  typing  one  of  a  number  of  responses,  each  of  which  is 
equivalent  to  one  of  the  responses  “yes”,  “no”,  and  "unknown”.  The  three  canonical 
responses  have  the  effect  of  declaring  the  fact  (as  asked)  to  be  true,  false,  or  indeterminate, 
respectively. 

The  second  scheme  for  requesting  input  is  the  menu.  A  menu  displays  a  question  and 
several  numbered  responses.  Each  of  the  responses  represents  a  fact,  and  all  the  facts  in  a 
given  iiicnu  form  a  mutually  exclusive  set  (i.e.  they  are  in  an  xor  list).  The  user  simply  types  in 
the  number  of  the  desired  response.  Genie  then  concludes  that  the  chosen  fact  is  true  and  that 
all  others  in  the  menu  are  false. 
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The  third  mode  of  input  is  the  numeric  question.  Whenever  Genie  requires  an  arithmetic 
fact,  rather  than  simply  request  verification  of  that  fact,  it  asks  for  the  numeric  value  itself. 
The  user  m^  respond  to  a  numeric  question  in  a  fairly  flexible  algebraic  notation.  Possible 
responses  include  "5”,  "x  >  3.14",  and  "0  <**«  x  <  100".  Once  some  value>range  has  been 
stored  for  the  numeric  value,  all  the  facts  that  require  the  value  are  checked  to  determine  their 
truths.  It  is  assumed  that  the  user’s  input  was  the  best  estimate  he  had  of  the  requested  value. 

In  addition  to  the  permissible  answers,  all  three  question  inodes  also  allow  immediate 
interrogation  about  the  context  in  which  the  question  is  being  asked.  Depending  on  one’s 
access  class,  one  may  type  “why”,  to  determine  why  the  requested  information  is  needed, 
"rule",  to  be  shown  the  rule  that  is  currently  being  tested,  and  "hyp",  to  be  shown  the 
hypothesis  currently  under  consideration. 

C.  Grammar 

Genie  performs  all  its  reasoning  on  coded  representations  of  the  domain  information  in 
the  knowledge  base.  Generally,  when  it  speaks  to  the  user  it  must  translate  rules  and  other 
structures  into  English.  To  do  this  Genie  depends  on  a  simple  yet  effective  pseudo-English 
grammar.  This  grammar  is  used  in  compiling  the  knowledge  base,  and  then  again  whenever 
pieces  of  knowledge  are  displayed  to  the  user. 

The  primary  piece  of  knowledge  that  must  be  formulated  into  English  is  the  fact.  Since  a 
proposition’s  assertion  and  its  negation  represent  the  same  knowledge  —  in  the  sense  that  the 
truth  value  of  either  one  follows  immediately  from  that  of  the  other  —  a  single  fact  frame  is 
used  to  represent  both.  Each  fact  frame  contains  a  Stmt  slot,  the  foundation  of  the  formula¬ 
tion  process,  in  which  is  stored  an  affirmative  English-language  statement  of  the  proposition. 

So,  both  for  building  fact  frames  and  for  displaying  knowledge  and  asking  questions,  the 
grammatical  requirements  are;  the  means  to  determine  the  sense  of  a  statement  (affirmative  or 
negative),  to  switch  the  sense  of  a  statement,  and  to  turn  the  statement  into  a  question.  Genie 
accomplishes  these  tasks  through  a  simple  scheme  that  matches  statements  against  grammatical 
patterns.  Given  the  fact  statement,  “Compressor  has  driver  rings",  Genie  will  create  the  ques¬ 
tion,  "Does  compressor  have  driver  rings?".  It  can  complement  the  fact  statement,  “Engine  is 
fully  encased”,  yielding  “Engine  is  not  fully  encased." 


V.  DATA  DIGESTION 

It  has  been  a  fundamental  design  goal  that  Genie  should  be  able  to  perform  all  of  its  tran¬ 
sactions  with  humans  in  nearly  natural  language.  When  a  new  rule  is  added  to  a  Genie 
knowledge  base,  the  first  form  that  it  takes  is  an  English-language  statement.  But  because 
Genie  cannot  actually  reason  using  natural  language  directly,  there  is  little  justification  for  using 
English  strings  as  the  medium  of  knowledge  representation  within  the  system.  The  natural- 
language  components  of  Genie  must  be  able  to  switch  between  the  internal  representation  and 
statements  in  English. 

Strong  arguments  against  storing  facts  as  strings  of  English  words  can  be  made  from  con¬ 
siderations  of  efficiency.  First,  it  is  redundant  and  wasteful  of  space  to  store  a  lengthy  state¬ 
ment  of  a  fact  in  every  rule  that  contains  the  fact.  But  more  importantly,  representing  the 
abstract  object  known  as  a  fact,  with  all  its  semantic  and  contextual  baggage,  as  a  mere  string  of 
words  is  unacceptably  limiting.  Both  goal-  and  datardriven  reasoning  require  the  pairing  up  of 
antecedents  and  conclusions  of  various  rules.  To  do  this  with  the  scanty  rule  representation 
that  provides  only  word  strings  requires  sequentially  checking  every  rule  in  the  knowledge  base, 
performing  a  time-consuming  word-for-word  check  to  see  if  facts  match.  Negated  facts  cause 
additional  headaches. 

For  these  reasons,  Genie  includes  a  module  that  preprocesses  its  knowledge  base.  Inter¬ 
nally,  rules  and  facts  are  represented  by  frames  and  referred  to  by  unique  names  called  code 


85 


gyinbolB  (like  **tule24*’  and  *7act7”).  The  key  function  in  the  data  digestion  process  is  called 
build-fra77}e9.  It  reads  the  knowledge-base  input  file  and  puts  all  the  static  knowledge  into 
Genic’s  internal  representation  formats.  As  an  example,  the  rule  frame  shown  in  figure  5a 
might  have  been  created  by  build^frames  from  the  input  in  figure  8. 

(IF  (there  are  struts  at  the  front  of  the  engine) 

(there  are  two  flanges  near  the  front  of  the  engine) 

I  flanges  are  not  extremely  close  together) 

THEN 

(engine  has  a  front  frame)) 

Figure  s. — Sample  Input  Rule. 


The  data  digestion  process  creates  rule,  fact,  and  concept  frames  where  necessary,  record¬ 
ing  them  in  tables  so  that,  for  instance,  occurrences  of  a  given  fact  in  subsequent  rules  will  be 
referred  to  by  the  same  fact  code.  So  the  only  searching  that  Genie  must  do  is  in  translating 
English  statements  into  code  symbols,  and  this  is  done  all  at  once  during  data  digestion.  Other 
steps  in  data  digestion  are  adding  the  links  between  rule  and  fact  frames  to  allow  chaining,  and 
linking  all  the  facts  in  each  xor  list.  Appendix  C  provides  a  specification  of  the  structures  per¬ 
missible  in  the  knowledge-base  input  file. 

When  an  end  user  encounters  Genie,  the  knowledge  base  has  already  been  preproccssed. 
The  improvement  in  performance  obtained  through  data  digestion  is  marked.  In  a  pure  produc¬ 
tion  system  the  mean  time  required  to  make  one  inference  grows  linearly  as  the  size  of  the 
knowledge  base  is  increased.  This  is  so  because  backward  chaining  cannot  be  performed 
without  searching  the  entire  knowledge  base  for  relevant  rules.  Digesting  the  rules  as  is  done 
in  Genie  speeds  the  process  considerably,  and  the  time  per  inference  is  independent  of  the  size 
of  the  knowledge  base.^ 


VI.  EXPLANATION  FACILITIES 

A  common  observation  among  those  who  have  designed  expert  systems  is  that  a  system 
must  be  accessible  or  it  will  not  be  used,  because  few  will  feel  confident  accepting  the  output  of 
a  black  box.  The  program  should  be  able  to  provide  information  about  its  reasoning  and  justify 
particular  inferences  when  the  user  requests  it  to  do  so.^  Genie  has  several  facilities  to  provide 
this  kind  of  explanation  of  its  behavior  to  the  user. 

The  simplest  mechanism  available  to  the  user  is  ihe  straight-forward  dump  of  current 
knowledge.  This  can  take  any  of  the  following  forms:  listing  all  the  facts  whose  Truths  are 
known,  listing  all  the  rules  whose  Statuses  are  known,  and  listing  all  the  concept  attributes  for 
which  some  value-grange  is  known.  A  related  mechanism  is  the  ’’find”  command,  which  finds 
all  facts  whose  Stmts  contain  specified  words. 

The  “show”  command  can  be  used  to  display  a  rule  or  a  fact  —  either  in  English  or 
frame  form  —  or  a  concept  frame.  This  is  often  useful  in  combination  with  "find”. 

The  highest-level  interrogation  commands  currently  available  to  the  user  arc  “how”  and 
“why”.  An  example  of  the  former  is  "how  factl7”,  meaning,  “By  what  means  did  you  deter¬ 
mine  the  truth  value  of  facU7?”.  To  answer  the  question,  Genie  checks  the  How  slot  in  the 
fact  frame,  and  prints  an  explanation  of  its  justification  for  concluding  the  TVuth  of  the  fact. 


^  K  Niija,  K.  Saaiki,  »Dd  H.  Ihar*.  "Au  ExpcrimenUl  ComparisoD  of  Knowledge  RepresenUtion  Schemes". 
AI  Magatint,  Summer  1084.  pp.  20-36. 

^  B.  G.  Duebanm  and  E.  H.  Shortliffe,  eds.  Ruit-Dond  Etptri  SytUmi.  Addison-Wesley,  1084.  pp.  S8-S0. 
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The  **why"  command  asks  the  question,  *'To  what  end  did  you  need  that  fact?’*  It  can  be  used 
to  determine  Genie’s  motivation  for  asking  a  question.  In  response,  Genie  displays  the  rule 
that  it  is  currently  attempting  to  fire. 

Taken  together,  these  commands  allow  one  to  discover  the  system’s  lines  of  reasoning. 
This  is  useful  for  the  rule  writer  in  ensuring  that  rules  interact  to  produce  the  intended  conclu* 
sions.  It  is  also  useful  for  the  end  user  in  deciding  whether  to  accept  the  system’s  conclusions 
and  the  system  itself. 


VII.  FUTURE  WORK 

There  are  several  changes  underway  to  improve  both  Genie’s  internal  processes  and  its 
man/macbine  interface.  Included  in  these  are  the  system’s  interfaces  with  the  rule  writer  and 
the  user,  uncertainty,  higher-level  organization  of  knowledge,  more  natural  user  interface. 

A. 

There  are  many  kinds  of  uncertainty  inherent  in  any  real-world  problem.  The  omnipres¬ 
ence  of  UDcertainfy  and  inexactitude  is  made  even  more  bothersome  by  their  intractability. 
Reasoning  effectively  in  the  face  of  these  obstacles  is  among  the  most  challenging  problems  in 
AI. 

Among  the  types  of  uncertainty  often  encountered  are  simple  probability  of  a  proposition 
(e.g.  “There  is  a  75  percent  probability  that  the  fragment  will  perforate  the  combustor  hous¬ 
ing.”),  fuzziness  of  a  proposition  (e.g.  “The  power  turbine  is  very  rugged.”),  rule  strength  (i.e. 
the  extent  to  which  the  rule  is  applicable),  and  rule  reliability  (when  knowledge  is  synthesized 
from  several  experts). 

We  have  considered  adopting  the  certainty  factor  technique  used  in  the  MYCIN  project. 
A  certainty  factor  is  a  scalar  that  is  associated  with  a  proposition  and  reflects  the  proposition’s 
probability.  Conclusions  of  rules  contain  certainty  factors,  so  one  rule  might  be  said  to  provide 
stronger  or  weaker  evidence  than  another  rule  with  the  same  conclusion.  However,  parametric 
studies  indicate  that  MYCIN’s  results  are  fairly  insensitive  to  the  choice  of  certainty  factors,  so 
this  method  may  not  be  highly  useful. 

Another  enticing  tool  is  the  theory  of  fuzzy  sets,  introduced  by  Lotfi  Zadeh.^’^'^  Fuzzy  set 
theory  promises  to  address  many  types  of  uncertainty.  We  have  begun  working  with  the  Joint 
BRL /A  MSA  A  Working  Group  on  Fuzzy  Sets  to  apply  this  approach  to  our  work. 

B,  Higher-Level  Organization  of  Knowledge 

The  exclusive  use  of  rules  to  represent  domain  knowledge  makes  it  impossible  to  capture 
any  but  the  simplest  patterns  in  the  knowledge.  More  abstract  organization  can  only  be 
represented  by  more  complex  structures.  In  order  for  Genie  to  reason  more  powerfully, 
explain  its  behavior  at  a  suitably  high  level,  and  be  appropriate  as  a  teaching  tool,  it  must  have 
a  fair  level  of  abstraction.  This  consideration  is  central  to  the  eohancemeot  of  Genie’s  perfor¬ 
mance. 

The  first  two  steps  toward  higher-level  organization  were  the  representation  of  rules  and 


®  L,  A.  Zad«h.  “Fafiy  SeU.”  Information  Control^  vol.  8,  1065,  pp.  338-353. 

^  L.  A.  Z&deb.  **Outlioe  of  4  New  Approach  to  the  Aoalysis  of  Complex  Sptems  aod  Decisloo  Processes,” 
lEB  Traniotiiono  onSytUmt,  Man,  and  vol.  flmc-3,  oo.  1,  Jaouary  1073,  pp.  2^-AA. 

Of 

L.  A.  Zadeh.  Tht  Belt  «/  Futtg  Logie  i*»  lAe  Sfanagtmtnl  of  Uncertainty  in  Eipert  Sij$Um$.  Memortoduin  No. 
UCB/ERL  M83/<1,  Eleclrooic«  R«se»rch  Labontory,  Collrge  of  EngiooeriDg,  Univer«ity  of  Cilifoniia, 
Berkeley,  1083. 
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facto  as  frames,  and  the  use  of  concept  frames.  Concept  frames  are  currently  used  only  for 
numeric  values  in  arithmetic  facto.  The  next  step  is  probably  to  use  a  more  robust  concept 
frame,  like  that  found  in  Lenat's  program,  Thus,  in  a  future  incarnation  Genie  might 

build  concept  frames  for  every  ''object"  mentioned  in  the  rule  base,  so  it  could  fulfill  requests 
like  "Show  every  rule  that  mentions  annular  combustors". 

Of  course,  implementing  this  capability  would  require  that  Che  input  rules  be  analyzed 
using  a  much  more  powerful  grammar  than  the  one  with  which  Genie  is  currently  endowed. 
Genie  cannot  now  be  said  to  understand  its  knowledge  base  in  abstract  terms:  it  breaks  rules 
into  facto  and  very  effectively  handles  the  relations  among  them,  but  on  fact  statements  it  only 
performs  surface^Ievel  manipulations.  Clearly,  a  deeper  understanding  would  require  more 
intelligence  in  interpreting  the  knowledge, 

C.  Rule  Writer 

As  a  rule  base  grows,  keeping  it  free  of  conflicts,  contradictions,  and  overlaps  becomes 
extremely  difficult.  While  building  individual  rules  is  clear  and  straightr forward,  ensuring  that 
the  integration  of  hundreds  of  rules  produces  the  desired  results  can  be  a  problem.  It  would  be 
a  great  advantage  to  have  a  component  that  helped  the  knowledge  engineer  manage  the 
development  and  maintenance  of  the  knowledge  base,  facilitating  addition  and  debugging  of 
rules. 

Like  most  of  the  enhancements  discussed  here,  a  rule  writing  tool’s  utility  depends  on  its 
intelligence.  For  example,  even  the  ability  to  recognize  when  two  similar  facto  have  related 
meanings  is  difficult  to  automate.  A  first^pass  rule  writer  could  be  made  to  compile  rules  into 
the  knowledge  base  as  they  were  entered.  Such  a  program  would  be  able  to  determine,  just  as 
Genie  does  now,  which  rules  use  given  facts. 

D.  More  Natural  User  Interface 

Genie  ought  to  be  able  to  converse  with  the  user  in  something  closer  to  normal  English. 
This  is  tied  in  with  increasing  the  order  in  the  knowledge  base,  since  sophisticated  statements 
are  built  from  and  reflect  elaborate  knowledge  structures. 

Another  major  improvement  in  the  user  interface  will  be  possible  when  Genie  is  moved 
to  a  LISP  machine  in  the  near  future.  These  single-user  work  stations  provide  a  remarkably 
powerful  environment  for  both  development  and  production  runs.  A  combination  of  mouse, 
windows,  pop-up  menus,  and  high-resolution  graphics  will  make  communication  simple  and 
quite  fast. 


^  D.  Le&ii.  A\f:  An  nriificiat  inteltigenee  approach  lo  diaeovery  m  mathtmaiici  a$  hturittic  teareh.  SAIL  AIM^ 
286,  SUeford  Artificial  iDklligence  Laboratory,  1076. 

10  * 

D.  A.  WaterroaD  and  F,  Hayes-Rotk,  cds.  PatUrrhDkecttd  Infertile  Sy$ttm$.  Academic  Press,  loc..  1078. 
pp.  30>33. 


88 


ANALYSIS  OF  GRADIENT  CHANGE  THRESHOLDS  IN  THE 
DETECTION  OF  EDGES  OF  OBJECTS  FROM  RANGE  DATA 

C.  N*  Shen  and  R.  L.  Racicot 
U.S.  Army  Armament,  Munitions,  and  Chemical  Command 
Armament  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
WatervUet,  NY  12189-5000 


ABSTRACT,  The  detection  of  the  gradient  change  for  an  object  or  an 
obstacle  can  be  analyzed-  The  ratio  of  the  magnitude  of  the  gradient  change 
to  measurement  noise  is  related  to  the  miss  and  false  alarm  of  the  system- 
The  spacing  of  the  measurements  also  affects  the  detection  threshold  of  this 
gradient  change, 

I ,  INTRODUCTION -  A  laser  range  finder  is  installed  on  the  top  of  a  mast 
attached  to  a  land  vehicle  or  a  helicopter.  This  laser  range  finder  measures 
the  distance  between  itself  and  the  ground  points  on  a  terrain  with  obstacles* 
For  segmentation,  the  edges  of  the  boulder  and  crater  must  be  estimated.  In 
determining  the  near  edges  of  a  boulder  or  the  far  edges  of  a  crater,  it  is 
necessary  to  locate  the  first  differences  of  the  slopes  of  the  terrain-  This 
is  equivalent  to  finding  the  second  differences  of  the  terrain  range  points. 
The  threshold  values  of  the  second  differences  can  be  in  terms  of  an  angle  in 
a  vertical  plane.  The  probabilities  of  miss  and  false  alarm  can  be 
ascertained  depending  on  the  method  of  estimation  and  the  scanning  scheme. 

II.  THE  LAPLACIAN  METHOD,  The  Laplacian  Method  considers  cross-sections 
of  terrain  and  looks  for  changes  in  slope  in  the  azimuth  or  radial  dlrection- 
Since  we  are  only  considering  one-dimensional  problems,  we  can  write  for  the 
measurement  equation 


=  di  +  Vi  (1) 

The  change  in  slope  is  estimated  by  computing  the  following  sufficient 
statistic; 

L 

Si  =  =1+1  -  2zi  +  zi_i  (2) 

This  is  a  digital  approximation  of  the  second  derivative,  or  second 
difference,  of  the  range. 

Due  to  the  presence  of  measurement  noise  In  the  calculation  of  s^,  it  is 
impossible  to  discern  with  absolute  certainty  whether  a  change  of  slope 
exists.  The  Neyman-Pearson  criterion  provides  a  decision  rule  which  we  may 
use  to  accurately  detect  edges  of  obstacles  with  a  known  probability  of  making 
an  error.  To  produce  the  decision  rule,  first  we  must  compute  the  variance  of 
as  follows.  We  assume  that  the  noise  components  Vj^  of  the  measurements  are 
Independent  Gaussian  random  variables  with  zero  mean  and  a  variance  of  o^. 
Then,  since  z±,  and  z±^i  are  independent,  Eqs.  (1)  and  (2)  yield 
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var(sj_)  =  var(zi+]^-2zi+Zj;-,.l) 


=  var(z^^^)  +  var(-2zj_)  +  var(zj^+3^) 

=  +  4a2  -h  =  6a^  (3) 

Because  is  a  scalar  random  variable,  the  Neyman-Pearsoti  criterion 
provides  a  decision  rule  identical  to  that  derived  using  hypothesis  testing. 
The  desired  decision  rule  is 

no  signal  if  -T  s^  <  T 

DECISIONi  =  (4) 

presence  of  signal  if  is  otherwise 

where  T  is  the  threshold  in  the  decision  process  which  can  be  determined  from 
the  equations 


Pp  =  2(|)[-T/v^]  (5) 

Pm  =  'l>[(T-u*)//^]  (6) 

where 

z 

(|)(z)  =  I/ZTit  /  dot  (7) 

—  CD 

It  is  noted  that  the  magnitude  of  Sj^  is  not  estimated  by  the  Laplacian  Method. 
The  quantity  u*  is  called  the  "minimum  detectable  change  of  slope,"  since  it 
is  the  smallest  change  of  slope  that  can  be  detected  with  a  miss  probability 
of  or  lower.  The  quantity  Pp  indicates  the  probability  of  a  false  alarm. 
The  miss  probability  Pj^  is  the  probability  of  not  detecting  a  true  change  of 
slope  equal  to  u* . 


Typically,  the  standard  deviation  Zr  is  a  known  system  parameter  and  u* 
is  chosen  so  that  suitable  values  of  and  P^^  can  be  obtained.  Table  1  shows 
the  trade-off  of  P^  vs.  P^  for  different  values  of  the  ratio  u*/Zr.  These 
values  are  computed  using  Eqs.  (5)  and  (6). 

TABLE  1.  FALSE  ALARM  PROBABILITIES  AND  MISS  PROBABILITIES 
FOR  VARIOUS  T/y^  AND  u*//r 


1  r 

I  I 

T 

1 

Pm  For 

Pm  For 

Pm  For 

Pm  For 

1 

1 

Pm  For 

T 

1 

I  T//R  I 

_ 

1 

1 

u*/V^2 

u*/  Zr=4 

u*//r^ 

1 

1 

u*/  Zr-6 

1 

1 

r  I 

I  1.000  1 

.3174 

T 

1 

.1587 

.0227 

1 

.0013 

.0000 

~r 

1 

.0000 

T 

1 

1  1.500  1 

.1336 

1 

.3085 

.0668 

.0062 

.0002 

.0000 

1 

1  1.679  1 

.0932 

1 

.3741 

.0932 

.0102 

.0005 

1 

.0000 

1 

1  2.000  1 

.0456 

1 

.5000 

.1587 

.0227 

.0013 

1 

.0000 

1 

1  2.146  1 

.0319 

1 

.5580 

.1966 

.0319 

.0022 

1 

.0001 

1 

1  2.500  1 

.0124 

1 

.6915 

.3085 

.0668 

.0062 

1 

.0002 

1 

1  3.000  1 

.0026 

1 

.8413 

.5000 

.1587 

.0227 

1 

.0013 

1 

1  3.500  I 

.0003 

1 

.9332 

.6915 

.3085 

.0668 

1 

.0062 

1 

1  4.000  1 

.0000 

1 

.9772 

.8413 

.5000 

.1587 

1 

.0227 

1 

_ _  _  _ 

a. 

_ 1 

_  .._J _ 

I _ 

_L 

1 
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III.  THE  RAPID  ESTIMATION  SCHEME  AMD  COMPARISON  OF  ERROR  PROBABILITIES. 
The  variance  of  for  the  Laplaclan  Method  is  usually  too  high  when  a  change 
of  slope  occurs  during  scanning.  An  adaptive  method  called  Rapid  Estimation 
Scheme  is  used  instead  to  keep  the  variance  lower,  thus  also  reducing  the 
probabilities  of  false  alarm  and  miss.  A  discussion  of  the  Second  Residual 
Method  for  the  Rapid  Estimation  Scheme  is  given  in  the  Appendix. 

The  different  expressions  for  the  error  probabilities  of  the  two  methods 

are  given  in  Eqs.  (5)  and  (6)  and  reiterated  in  Table  2j _ We  see  that  the 

expressions  for  both  methods  are  the  same  except  that  6a^  appears  with  the 
Laplaclan  Method,  whereas  appears  with  the  Second  Residual  Method. 

TABLE  2.  ERROR  PROBABILITIES 


T - 

1  Method 

1 

T" 

I 

I 

Probability  of  False  Alarm 

1 

1  Probability  of  Miss 

1 

T 

1 

1 

1 

I  Laplacian 

I 

T" 

I 

I 

pL  =  2(|)[-T/V^] 

P 

1  =  <t>[(T-u*)//6^] 

1  ^ 

T 

1 

1 

I 

I  Second  Residual 

I 

I 

pS  =  2(|)[-T//^] 

1  pS  =  4,[(T-u*)//^I^] 

1 

1 

J _ 

,L=. 

F 

l_M _ 

1 

We  know  that  the  Second  Residual  variance  is  less  than  or  equal  to  the 
Laplaclan  variance: 


(8) 

Since  <j)(z)  is  a  monotonically  increasing  function  and  has  a  negative 
argument  in  all  the  expressions  in  Table  2  (we  assume  T-*u*  <  0) ,  we  can 
conclude  that 

(Second  Residual)  <  (Laplaclan)  (9) 

F  F 

P^  (Second  Residual)  <  P^  (Laplacian)  (10) 

M  M 

Figure  1  illustrates  how  the  smaller  Second  Residual  variance  leads  to  smaller 
error  probabilities. 

With  smaller  error  probabilities,  we  expect  the  Second  Residual  Method 
will  have  a  better  performance  over  the  Laplaclan  Hethod- 

IV.  MINIMUM  SLOPES  OF  DETECTABLE  OBSTAaES.  In  this  section  we  will 
determine  the  threshold  in  edging  of  gradient  changes  of  obstacles.  These  Toay 
be  detected  using  the  Second  Residual  or  the  Laplacian  Method. 

Obstacles  may  or  may  not  be  detectable  depending  on  the  change  of  slopes 
of  the  terrain  at  its  edges.  It  will  be  shown  here  that  the  minimum 
detectable  slope  change  depends  upon  a  given  group  of  parameters. 

Figure  2  shows  a  diagram  of  an  obstacle-  C  and  B  are  consecutive  points 
where  laser  beams  emanating  from  laser  range  finder  bounce  off  the  terraln- 
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The  slope  of  the  obstacle  is  tan  0,  since  the  slope  between  C  and  B  is  zero. 
The  mast  height  is  b,  the  height  at  which  the  laser  range  finder  is  located. 
Point  D  is  the  estimate  of  the  range  based  on  the  data  at  point  B  and  previous 
points.  JThus,  the  rang^e  of  point  C  is  measured  as  Zi-(-2  and  point  D  lies  at  a 
range  of  d^^2  >  where  dj^^2  ^^^l+l^*i+l  prediction  of  the  range  dj^-|-2 

from  previous  data.  We  see  that  the  distance  from  C  to  D  is  the  residue, 

^i+2‘  apparent  that  the  expected  value  of  rj^^2  takes  its  minimum  value, 

u* .  From  the  geometry,  the  quantity  tan  0  can  be  computed  by  finding  x/y. 
These  quantities  are  found  as  follows: 

X  -  u*  sin  S  (11) 


y  =  Ap  -  cos  3 


(12) 


then 


tan  0  ^  x/y  =  (u*  sin  S)/(Ap  -  u*  cos  0) 


(13) 


By  extending  the  above  idea,  we  have  two  slopes,  AB  and  BC,  instead  of 
one  in  the  previous  cases.  The  angle  y  is  the  difference  of  the  angle  0  for 
slope  BA  and  the  angle  ^  for  slope  CB,  as  shown  in  Figure  3-  Now  the  slope  of 
Y  becomes 


tan  Y  =  -  ,  (Y  =  (14) 

b 

where 

a  -  u*  sin  (frf^))  (15) 


and 


b  =  (Ap)  cos  <{>  -  (Ap)  sin  (}>  cot  ( 0+<l>)  -  u*  cos  (^b^*) 


=  (Ap) 


sin  cos  <|)  -  cos  ( 3+(|>)  sin  ij) 

sin 


U*  cos  ( 


(Ap)  sin  g 

- - - - -  u*  cos  (&••<{>) 

sin  (3+<|») 

=  (Ap)  sin  e  sin  (gf(|))  [ - - - - - - - - - ] 

sin^  0+<)>)  (Ap)  sin  g  tan  (gfcj)) 

Combining  the  terras,  we  have 

(u*/t)  tan^  (^^"il*) 

1  +  tan^  (0+<j>)  -  (u*/ t)  tan  (gh*}*) 

where 


(16) 


(17) 


T  =  (Ap)  sin  g 


(18) 
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The  quantity  x  is  the  projection  of  the  data  spacing  in  the  direction  of  sin 

e. 

If  Eq.  (17)  Is  solved  for  u*/t,  we  have 

A  tan  Y  [1  +  tan^ 

u*/t  =  K(Y,3+(j))  - - - - -  (19) 

tan  (&^-<|>)  [tan  (&f<J))  +  tan  y] 

Table  3  gives  the  values  of  K  as  a  function  of  angle  of  detection  Y,  the 
sums  of  the  elevation  angle  3,  and  one  of  angle  4*  for  the  terrain. 

The  above  equation  can  also  be  derived  from  Figure  4  which  gives 

A 

u*/t  =  K(Y,fr»-<|))  =  [cot  (&f<|.)  -  cot  (&»-<{d-Y)]  (20) 

V.  FUNCTION  K  OF  ANGULAR  GEOMETRY.  If  we  can  Iceep  the  ratio  u*/t 
constant  in  Eq.  (19)  or  (20),  then  the  value  of  K(i^&f(|))  will  be  constant  in 
Table  3.  This  can  be  achieved  by  letting  both  m*//r  and  x/ /r  be  constant. 
First,  we  will  discuss  the  value  of  u*//r. 

Table  1  shows  the  values  of  Pp  for  given  threshold  to  noise  ratio  T/»^ 
which  guarantees  the  probability  of  detection.  It  also  lists  the  values  of  Pj^ 
for  both  T//r  and  the  signal  to  noise  ratio,  u*/*^.  For  example,  let  us  take 
T//R  =  1.679  and  u*//R  “3.  We  have 

Pp  =  0.0932  (21a) 

Pm  =  0.0932  (21b) 

which  are  reasonable  values  for  our  problem. 

The  second  part  is  to  keep  the  ratio  x/i^  constant.  This  is  related 
to  the  scanning  scheme  given  in  the  next  section.  From  Eq.  (18),  we  have 

x//r  =  (Ap)(sin  3)/*^  =  constant  =  L  (22) 

Then  the  ratios 

u*/v^  3 

- ---K(Y,&f4>)  (23) 

x/i^  L 

If  L  is  chosen  as  4,  then  one  will  look  at  the  points  for  K  =  0.75  in  Table 
3.  If  L  is  chosen  as  1.6,  then  one  will  look  at  the  points  for  K  “  1.875  in 
Table  3.  For  K  =  0.75,  the  following  set  of  angles  appears 

3+(|)  15“  20“  25“  30“ 

Y  3.8“  6.7“  10.2“  15.8“ 

In  summary,  the  above  value  of  Y  is  guaranteed  to  be  detected  for  the 
conditions 
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TABLE  3.  VALUES  OF  K(t,  AS  FUNCTIONS  OF  GAMMA  AND  BETA  +  PHI 


T 

1 

1 

i 

Gamma 

1 

1 

1  5 

1 

T 

1 

1 

1 

10 

T 

1 

1 

1 

15 

T 

1 

1 

1 

20 

T 

1 

1 

1 

,  1 

1 

25  1 

1 

Beta  +  Phi 
30 

1 - 

I 

I  35 

T 

1 

1 

1 

40 

— 

45 

T 

1 

1 

1 

50 

T 

1 

1 

1 

T 

1 

0 

1 

1  0.0 

T 

i 

0.0 

T 

i 

0.0 

T 

1 

0.0 

T 

1 

1 

0.0  1 

0.0 

I 

I  0.0 

T 

1 

0.0 

0.0 

T 

1 

0.0 

T 

1 

! 

2 

1  3.2857 

1 

0.9667 

i 

0.4612 

1 

0.2724 

1 

0.1819  1 

0.1317 

I  0.1011 

1 

0.0811 

0.0675 

1 

0.0578 

1 

1 

4 

1  5.1163 

1 

i:6605 

i 

0.8278 

1 

0.5014 

1 

0.3405  1 

0.2495 

1  0.1933 

1 

0.1562 

0.1307 

1 

0.1126 

1 

1 

6 

1  6.2855 

1 

2.1839 

1 

1.1270 

1 

0.6972 

1 

0.4802  1 

0.3557 

i  0.2778 

I 

0.2261 

0.1902 

1 

0.1646 

1 

1 

8 

1  7.0986 

1 

2.5936 

1 

1.3762 

1 

0.8668 

1 

0.6046  1 

0.4521 

1  0.3558 

i 

0.2913 

0.2464 

1 

0.2142 

1 

! 

10 

1  7.6980 

1 

2.9238 

i 

1.5875 

1 

1.0154 

1 

0.7164  1 

0.5403 

1  0.4281 

1 

0.3527 

0.2998 

i 

0.2617 

1 

1 

12 

1  8.1592 

1 

3.1962 

1 

1.7694 

i 

1.1471 

1 

0.8175  1 

0.6214 

1  0.4956 

1 

0.4105 

0.3506 

1 

0.3074 

1 

1 

14 

1  8.5258 

1 

3.4252 

1 

1.9280 

i 

1.2649 

i 

0.9096  1 

0.6965 

1  0.5589 

1 

0.4652 

0.3991 

1 

0.3514 

1 

1 

16 

1  8.8250 

1 

3.6210 

1 

2.0678 

! 

1.3711 

1 

0.9941  1 

0.7664 

1  0.6184 

1 

0.5172 

0.4457 

1 

0.3939 

1 

i 

18 

i  9.0742 

1 

3.7906 

2.1922 

1 

1.4675 

i 

1.0721  1 

0.8316 

1  0.6746 

1 

0.5669 

0.4905 

1 

0.4351 

1 

1 

20 

!  9.2856 

1 

3.9392 

1 

2.3039 

1 

1.5557 

1 

1.1445  1 

0.8930 

1  0.7279 

0.6144 

0.5337 

1 

0.4751 

1 

1 

22 

t  9.4674 

1 

4.0709 

I 

2.4050 

1 

1.6369 

1 

1.2120  1 

0.9508 

1  0.7787 

1 

0.6600 

0.5755 

1 

0.5142 

t 

1 

24 

1  9.6260 

1 

4.1887 

i 

2.4972 

1 

1.7119 

1 

1.2752  1 

1.0055 

1  0.8273 

1 

0.7040 

0.6161 

0.5524 

1 

! 

26 

1  9.7658 

1 

4.2949 

1 

2.5817 

1 

1.7818 

1 

1.3347  i 

1.0575 

1  0.8738 

1 

0.7465 

0.6557 

1 

0.5898 

1 
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I 
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1.9084 
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1.1547 

1  0.9618 
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4.5607 
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2.7995 
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1.9662 
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1.2003 

1  1.0037 
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0.8668 

0.7691 
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0.6986 
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1 
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4.6357 
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1 

2.0209 
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1.2443 

i  1.0443 

i 

0.9050 

0.8056 

1 

0.7340 

! 

i 
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2.9223 

1 

2.0730 

1 

1.5902  i 

1.2868 

1  1.0838 

1 

0.9424  1 

0.8416 

1 

0.7692 
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1 

38 

i  10.3577 

i 

4.7709 

1 

2.9785 

1 

2.1226 
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1.6350  1 

1.3280 
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0.9792  1 

0.8772 

1 

0.8042 
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1 
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4.8322 

1 

3.0318 

1 

2.1701 
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1  1.1602 
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0.8391 
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1 
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i  10.9638 
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! 

J_ 

5.3073 
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1 

3.4641 

1 

1 

2.5711 

1 

J_ 

2.0570  1 

1 

1.7321 

i  1.5156 

1 

1 

1. 

1.3681  1 
_ 1 

1.2679 

1 

L 

1.2031 

1 

1 

u*/i^  “3  ,  Pp  —  P(j  “  0*0932  ,  t/ /r  =  4 


and  the  variable  3+4*  as  listed. 


VI.  THE  SCANNING  SCHEME.  In  order  for  the  value  of  L  to  be  constant  in 
Eq.  (22),  one  will  take  the  discrete  form  in  Figure  5  as 

CAp)l  sin  $1  =  (Ap)2  sin  32  “  ^  “  constant  (23) 

If  b  is  the  height  of  the  mast,  then 


sin  3i  = 


b 


sin  32 


-  b 
*'b^P2^ 


Thus,  we  have  in  Figure  4 

(Ap)i  (Ap)2 

/b^fp^  /b^P2^ 


b 


(24) 


(25) 


The  above  equation  indicates  that  the  spacing  of  the  horizontal 
projection  is  proportional  to  the  radial  distances  from  the  laser  to  points  on 
the  horizontal  plane.  For  example,  let  us  take  b  =  2,  then 

(Ap)i  (Ap)2 

/4+P2^2  1(4+ P2^  ^ 


or 

(Ap)i  -  (/4+^)/rL/2  (26) 

For  example,  given  /r  =*  0.125  and  L  =  1.6  or  /r  =  0.05  and  L  -  4.0,  in  both 
cases  we  have  t  -  =  0-20.  Then,  from  Eq*  (26),  we  have 

p  2m  5ra  10  m  20  m 

Ap  0.2828  m  0.5353  m  1.1832  ra  2.010  m 


In  this  case  we  may  miss  a  boulder  of  0.2  m  at  2  m  away  or  a  boulder  of  2  m  at 
20  m  away. 

VII.  PROBABILITY  OF  DETECTIOH  FOR  RANDOMLY  LOCATED  EDGES.  In  the 
previous  sections  it  was  assumecl  that  range  measuremi^at^^  were  available  from 
the  range  finder  to  the  exact  vertex  of  an  edge.  This  is  represented  by  point 
3  in  Figures  2  and  3-  An  edge  is  defined  by  a  discrete  angular  change  y. 

By  assuming  a  range  measurement  point  to  fall  on  the  edge  vertex,  results  in  a 
single  value  for  the  residual  u*  in  Eq-  (19).  This  corresponded  to  a  single 
value  for  the  Pq  probability  of  detection,  in  which  Pq  -  1-Pm  with  Pj|  being 
given  by  Eq.  (6). 
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Tn  practice,  the  edges  of  objects  might  be  randomly  distributed  and  the 
range  measurements.  In  general,  might  not  fall  on  an  edge  vertex.  A  more 
reallsltc  approach,  therefore,  might  be  to  treat  the  range  measurement^  t^^  be 
randOTily  distributed  near  an  edge  represented,  for  example,  by  points  A,  B, 
and  C  In  Figure  6.  A  random  residual  u*  would  result  for  given  edge  angular 
change  T  Instead  of  the  constant  value  of  u*  assumed  previously. 


The  probability  of  detection  in  ^hls  case  can  be  calculated  using  Eq.  (6) 
as  a  function  of  the  random  variable  u*;  _ 

_  _  T-u* 

P(Detection| u*)  =  1  -  P  (Misslu*)  =  1  -  $[-- — ]  (27) 


Form  geometric  considerations  it  can  be  shown  that  u*  will  range  approximately 
from  u*/2  to  u*  in  Figure  6.  It  can  be  further  assumed  that  the  distribution 
of  will  be  uniform  over  this  range  of  values  which  corresponds  to  a  purely 
random  distribution  of  object  edges  on  a  given  terrain.  Equation  (27)  can 
then  be  used  to  determine  the  total  probability  of  detection: 


u*  _  _  _ 

P(Detectlon)  =  /  P(Det| u*)f (u*)du* 
u*/2 


.u" 


u*  ‘^u*/2 


P(Det| u*)du* 


T-u* 

(l-$[ - ])du* 

/k 


(28) 


in  which  f(u*)  equals  uniform  distribution  on  (u*/2  to  u*),  and  u*  is  given  by 
Eq.  (19). 

Equation  (28)  can  be  solved  as  a  function  of  T,  and  u*  where  u*  is  a 

function  of  Y,  ( &+<t>) ,  and  x  as  in  Eq.  (19).  As  an  example,  let 

T//r  =  threshold  level  for  detection 

=  2.146;  gives  Pp  =  0.0319  from  Table  I 

/r  =  where  o  =  range  data  noise  level 

=  0.1225  for  0  =•  0.05 

T  »  0.20 

The  resulting  P(Detectlon)  is  shown  in  Table  4  as  a  function  of  the  angle 
change  Y  and  (3+4')*  Other  similar  information  can  readily  be  generated  for 
other  parameter  values  depending  on  the  actual  problem  to  be  solved. 


The  behavior  of  the  minimum  detectable  angular  change  Y  can  be  readily 
determined  from  the  type  of  information  given  in  Table  4.  For  example,  by 
requiring  P(Detection)  =  0.90  and  letting  =  15  degrees,  results  in  a 
minimum  detectable  value  of  Y  to  be  about  40  degrees.  This  compares  to  a 
value  of  Y  ut  13  degrees  for  the  nonrandom  case  previously  considered  where 
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TABLE  4.  TOTAL  PROBABILITY  OF  DETECTION  FOR  RANDOMLY  LOCATED  EDGES 


T 

i 

1 

1 

Gamma 

i  2 

T 

1 

1 

1 

4 

1 

1 

1  6 

1 

T 

1 

1 

1 

8 

T 

1 

1 

1 

10 

T 

1 

1 

1 

Beta  +  Phi 
12 

1 - 

I 

I  14 

1 

T 
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1 

1 

16 

18 

T 

1 

1 

1 

20 

T 

1 

1 

1 

T 

1 

0 

1  0.0 

T 

1 

0.0 

1 

1  0.0 

1 

1 

0.0 

T 

1 

0.0 

T 

1 

0.0 

1 

1  0.0 

1 

1 

0.0 

0.0 

T 

1 

0.0 

T 

1 

1 

2 

1  1.0000 
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1  0.7526 

1 

0.3606 

1 

0.1741 

1 

0.1004 

I  0.0676 

1 

0.0508 

0.0412 

! 

0.0352 

I 

i 

4 

1  1.0000 

1.0000 

1  0.9730 

1 

0.7563 

1 

0.4583 

1 

0.2645 

i  0.1636 

1 

0.1110 

0.0816 

1 

0.0641 

1 

1 

6 

!  1.0000 

1 

1.0000 

t  0.9962 

1 

0.9077 

1 

0.6794 

1 

0.4431 

1  0.2831 

t 

0.1891 

0.1343 

1 

0.1011 

1 

1 

8 

1  1.0000 

1 
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1  0.9993 
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0.9605 

i 

0.8079 
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0.5895 

1  0.4016 
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0.2742 
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1 
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1  0.9998 
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1 

0.6960 
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1 
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1 

1 
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1 
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1 
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1 
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1 
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1 

1 

1 
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1 
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1 
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1 

1 
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! 

,L 

0.6743 

! 

_L 

the  range  Is  assumed  to  be  measured  directly  to  the  edge  vertex. 


Vlll.  CONCLUSION.  For  an  assigned  probability  of  false  alarm  and  miss, 
one  can  determine  the  signal  to  noise  ratio  and  the  threshold  to  noise  ratio. 
If  we  use  a  special  scanning  scheme  such  that  the  spacing  of  horizontal 
projections  is  proportional  to  the  radial  distances  from  the  laser  to  a 
horizontal  plane,  then  the  function  K  of  angular  geometry  is  also  constant. 

The  detectible  angle  Y  with  certain  probability  can  be  found  if  K  and  &l<|>  are 
known.  The  angles  3+(|»  are  related  to  the  elevation  angle  3  and  the  terrain 
slope  4). 

The  following  results  were  discussed  in  this  paper; 

1.  Ability  to  detect  the  change  of  slopes  in  a  terrain  for  navigation  of 
vehicles  or  robotic  platforms. 

2.  Determination  of  the  probability  of  false  alarm  and  the  probability 
of  detection  for  various  signal  to  noise  ratios  and  for  the  case  of  randomly 
distributed  measurements. 

3.  Determination  of  required  signal  to  scanning  factor  ratios  for 
various  slopes  and  slope  changes.  The  signals  relate  to  the  probability  of 
detection  and  the  scanning  factor  influences  the  size  of  the  obstacles. 

4.  Computation  of  the  probability  for  detection  of  an  edge  if  the 
reference  directions  of  the  laser  rays  are  uniformly  distributed  over  the 
slopes  near  an  edge  point. 


98 


APPENDIX 


THE  SECOND  RESIDUAL  METHOD 

The  Second  Residual  Method,  similar  to  the  case  case  with  the  Laplacian 
Method,  considers  cross-sections  of  terrain  and  looks  for  changes  in  slope. 
Here,  however,  a  state  estimation  and  decision  process  is  used  to  perform  the 
detection.  A  discrete  second  order  linear  time-varying  system  model  is  used 
to  estimate  ranges  and  gradients  (slopes)  from  current  and  previous  data.  If 
the  difference  between  a  range  measurement  and  range  prediction  is  large 
enough,  a  change  in  slope  is  indicated  and  a  special  estimation  scheme  is 
employed. 

THE  SYSTEM  MODEL.  A  stabilized  system  model  for  a  terrain  has  been 
proposed.  The  state  vector  is 


Xi  =  [di.  gll'f  (Al) 

where  indicates  the  i-th  range  and  gj  the  i-th  gradient  (or  slope) .  A 
change  of  slope  is  modelled  by  the  presence  of  an  unknown  Input,  uj^,  which 
adds  to  the  gradient  component  of  through  the  input  matrix.  The  system 
model  is 


Xi+1  =  FiXi  +  Bui 


(A2) 


where 


1 

1 

—  — 1 

1  bi 

1-c  1 

1  0  1 

1 

and 

B  =  1  1 

1  0 

^i  1 

1  1  1 
1  1 

where  and  q-j^  are  the  time  variant  parameters  and  c  is  a  small  non-negative 
real  number  necessary  for  system  stability  while  scanning  inward  from 
skylines.  The  measurement  equation  is 

z  1-1-1  =  Hxj^+1  +  Vi^-i  (A3) 

where  H  -  [1,0]  and  is  zero  mean  Gaussian  noise  where 

0  for  all  i  ^  j 

E{vlvj}  =  (A4) 

for  all  i  -  j 

Our  problem,  then,  is  to  detect  any  small  nonzero  inputs  u^,  since 
they  represent  a  change  in  slope - 

As  in  the  Laplacian  Method,  a  sufficient  statistic  is  necessary  to  detect 
the  change  of  slopes.  This  sufficient  statistic  is  called  the  residue, 
as 
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^i+2  =  ^i+2  ~ 


(A5) 


where  Is  optimal  estimate  of  given  z±+i,  It  can  be 

shown  that 

Etri+l)  = 

=  (l-c)Ui  S  Ui  (A6) 
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Figure  5,  Scanning  Factors 
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Abstract.  Present  computer  simulations  use  an  acquisition  model  that  employs 
the  Johnston  criteria  for  identification.  This  model  also  provides  criteria 
for  different  states  of  acquisition  such  as  detection,  recognition,  and  iden¬ 
tification.  However,  experimental  results  obtained  during  tests  have  shown 
that  it  is  not  always  necessary  to  go  through  all  the  stages  of  acquisition  in 
sequence  to  achieve  identification.  Additionally,  some  unique  system  charac¬ 
teristics  other  than  size  in  relation  to  contrast  can  be  used  to  acquire  objects 
even  when  these  objects  are  partially  obscured.  Other  experimentally  observed 
obscuration  factors  are  also  discussed  which  are  related  to  motion  and  distance 
for  which  different  stimuli  are  required  for  identification.  These  experimental 
results  are  discussed  and  a  mathematical  model  encompassing  these  observations 
is  presented. 

1.  Introduction.  This  paper  is  concerned  with  the  image  perceived  by  a  viewer 
using  a  thermal  sight  (IR  sensor  and  imaging  device).  Inasmuch  as  thermal 
sights  are  being  used  by  many  of  the  Army's  advanced  weapon  systems,  including 
tanks  and  fighting  vehicles,  as  one  of  their  principle  means  of  target  acqui¬ 
sition,  it  becomes  important  to  investigate  the  human/sight  interface  in  the 
acquisition  process.  This  paper  will  concentrate  on  four  factors--atmospheric 
conditions,  thermal  image,  obscurants,  and  di stance--which  have  been  shown 
experimentally  to  influence  the  image  perceived  by  a  human  through  a  thermal 
sight.  A  mathematical  model  of  the  influence  of  these  factors  on  the  thermal 
sight  and  human  interface  will  be  presented  for  possible  use  in  modeling  this 
relationship.  A  comparison  of  this  model  with  present  techniques  will  be  dis¬ 
cussed. 

2.  Background.  Presently  equations  such  as  the  Beers-Lambert  Law  or  the 
Bougher-Lambert  Law  have  been  modified  to  account  for  thermal  signature  attenu¬ 
ation  as  a  function  of  temperature  contrast  between  the  object  and  its  back¬ 
ground,  This  formula  is  given  in  equation  1  below 

T.Toe-*”'  (1) 


where 


T  =  attenuated  temperature  in  °C 
Tq  =  object  intrinsic  temperature  contrast  in  °C 
R  =  Range  between  object  and  sensor  (km) 
d  =  Extinction  coefficient  (Neper  per  km) 
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This  equation  offers  a  means  of  determining  thermal  contrasts  as  a  continuous 
function  of  R  until  some  response  threshold  Aj  is  reached  beyond  which  the  sensor 
does  not  function.  Acquisition  then  occurs  when  A  >  At  where  A  is  given  by 
equation  (2). 


T  -  Tb 
Tb 


T  =  Thermal  contrast  from  formula  1. 


(2) 


Tg  =  Thermal  background 

Conversely,  when  A<At,  then  acquisition  does  not  occur. 

3.  Discussion.  In  reality  although  differences  in  temperature  between  an  ob¬ 
ject  and  its  background  affect  the  target  acquisition  process,  there  is  more  to 
acquisition  than  thermal  contrasts.  For  example,  there  is  motion  as  well  as  the 
uniqueness  of  thermal  signatures  in  relationship  to  their  position  relative  to 
the  object.  To  illustrate  this  point,  suppose  an  object  has  a  temperature  Tq 
and  this  temperature  relative  to  its  background  is  greater  than  some  threshold 
t,  then  this  object  is  discernible  by  a  thermal  sensor  within  the  range  of  the 
sensor.  However,  depending  on  the  extent  of  the  background  object  contrast 
in  relationship  to  the  size  of  the  object,  the  degree  of  the  acquisition  is 
determined;  i.e.,  detection  through  identification  friend  or  foe. 

This  method  which  is  currently  used  in  modeling  does  not  take  into  account  the 
uniqueness  of  certain  aspects  of  the  thermal  signature  which  are  neither  re¬ 
lated  to  the  temperature  of  the  object  nor  its  size.  For  example,  an  exhaust 
plume  from  a  vehicle  which  exhibits  directionality  can  in  some  instances  lead 
to  the  identification  of  an  object  where  the  object  background  contrast  rela¬ 
tionship  would  indicate  that  the  object  should  be  at  the  detection  phase  of 
the  acquisition  process.  Since  the  exhaust  is  separate  from  the  object,  unless 
it  is  considered  in  the  acquisition  process,  the  relationship  between  distance 
and  thermal  contrasts  can  be  misunderstood. 


4,  Model . 

a.  General .  In  order  to  include  such  items  as  exhaust  plumes  and  other  hot 
areas  in  thermal  signatures,  the  following  model  was  developed.  The  three  di¬ 
mensions  of  the  model  are  atmosphere,  image,  and  distance.  Obscurants  are  an 
additional  dimension  which  enters  into  the  equation  as  a  temporary  condition. 

The  model  considers  atmosphere  and  image  first,  then  atmosphere,  obscurants  and 
image.  It  then  considers  atmosphere,  distance  and  image,  and  finally  atmosphere, 
distance,  obscurants  and  image, 

b.  Atmosphere  and  Image.  Let  C  be  a  set  of  characteristic  curves  on  some 
variable  t  with  G  a  set  of  functions  on  C.  For  some  a  we  have  n^,  Ig,  Cg  c  C 
and  Oa  c  G  such  that 
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What  this  model  explains  is  the  manner  in  which  the  thenmal  image  is  displayed 
on  the  screen  of  a  thermal  sight.  The  set  C  is  the  set  of  mappings  of  the 
lines  on  the  screen.  The  image  of  these  mappings  is  mapped  into  subportions 
of  the  screen  by  the  set  of  mappings  G.  For  a  given  atmosphere  and  object 
there  is  an  associated  image  I^.  When  the  union  of  the  images  associated  with 
the  subportions  of  the  screen  are  equal  to  Ig,  the  object  is  then  identified. 
The  question  of  what  happens  when  some  but  not  all  of  the  subportions  are  miss¬ 
ing  is  addressed  in  the  next  section. 

c.  Atmosphere.  Image  and  Obscurants.  Obscurants  such  as  chemical  smoke  or 
dust  cause  a  subset  of  the  subportions  of  the  image  Ig  to  be  partially  obscured 
The  question  then  arises  as  to  how  well  the  human  can  complete  the  gestalt  with 
missing  pieces.  This  capability  happens  to  be  a  human  trait. 

Let  p  represent  obscurant  affect  with  IpC  I.  Given  an  a,  we  have  for  Ipg  9  la 

*^ai  •  ^*^^pai 
9ai  •  ^  pai  “^^pai  • 

In  this  model  it  is  assumed,  just  as  it  has  been  demonstrated  in  numerous  psy¬ 
chological  experiments  that  humans  can  complete  the  gestalt.  It  is  also  as¬ 
sumed  that  the  human  is  not  just  reading  into  the  gestalt  in  relation  to  his 
imagination  as  would  be  the  case  in  an  ink  blot  or  Roscharch  test,  but  that  a 
true  image  will  be  discernible  from  the  partial  image  Ipg.  Thus,  although  it 
is  expected  that  one  would  require  all  the  parts,  it  is^possible  for  a  human  to 
estimate  all  the  parts  from  some  of  the  parts.  Thus,  identification  need  not 
include  all  the  parts  in  order  to  occur. 

d.  Atmosphere,  Distance  and  Image.  Clearly  C,  the  set  of  characteristic 
curves,  will  vary  with  distance  d.  Thus,  for  a  given  distance  we  have 

^adi  •  ^  adi  9adi  adi”^^adi  9  ^ad 

Again,  if  U  Ig^j^  =  Ig^j,  identification  has  occurred. 
i=l 

In  this  model  the  image  has  been  adjusted  for  distance.  This  is  the  case 
with  objects  in  relation  to  their  background.  For  example,  telephone  poles  at 
a  distance  look  smaller  than  telephone  poles  that  are  close  up,  but  everything 
else  in  the  distance  is  proportional  to  the  object  being  identified.  This  phe¬ 
nomenon  is  also  well  documented  »n  elementary  psychology  texts.  It  is  the 
adjustment  intended  for  Ig^j. 
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e.  Atmosphere,  Distance.  Image,  and  Obscurants.  As  discussed  in  4c,  ob¬ 
scurants  can  cause  a  subset  of  the  subportions  of  the  image  lad  to  be  partially 
obscured.  Thus,  for  a  given  distance  d  and  an  obscurant  effect  p 

Cadi  •  t  -»Xpadi  9padi  •  ^  padi  Ipadi  9  ^pad 
"a 

As  before,  if  U  Ipadi  -  Ipad»  identification  of  a  partially  obscured  object 
i=l 

has  occurred. 

This  last  model  is  the  most  general  and  takes  into  account  the  dimensions  men¬ 
tioned  in  this  paper.  In  all  of  these  models  so  far  motion  has  not  been  included 
and  these  models  represent  an  object  sensed  at  a  fixed. distance  from  the  sensor. 
However,  motion  does  affect  one's  perception  of  what  is  displayed  on  the  screen 
of  a  thermal  sight  and  will  cause  the  observer  to  become  aware  of  something  new 
appearing  on  the  screen  as  well  as  continuously  changing  its  position.  This 
change  in  gestalt  included  with  a  unique  signature  causes  objects  to  be  detected 
and  identified  simultaneously,  which  is  not  predicted  from  equations  (1)  and  (2). 
Equations  (1)  and  (2)  predict  a  more  gradual  transition  through  the  acquisition 
process,  whereas  motion  and  signature  allow  the  observer  to  accomplish  the 
acquisition  process  from  detection  to  identification  almost  instantaneously, 
and,  therefore,  at  the  same  apparent  range.  This  has  been  demonstrated  by 
several  studies  of  which  the  author  is  aware  (1,  2,  and  3). 

5.  Summary.  Target  acquisition  by  a  thermal  sight  as  presently  modeled  through 
the  use  of  such  equations  as  (1)  and  (2)  requires  revisiting.  These  models 
predict  a  gradual  transition  through  the  acquisition  process.  However,  actual 
tests  by  troops  with  moving  targets  indicate  that  real  targets  and  not  ones 
that  have  been  made  to  appear  like  a  target  are  identified  at  the  same  time 
that  they  are  detected.  Since  the  degree  of  target  acquisition  is  related  to 
the  command  to  fire,  this  would  imply  that  opening  engagements  might  take 
place  earlier  and  at  greater  ranges  than  would  be  indicated  through  the  use 
of  equations  such  as  (1)  or  (2).  Inasmuch  as  the  models  presented  in  this 
paper  would  be  difficult  to  convert  into  mathematical  formulas,  until  such 
equations  are  developed,  a  substitute  such  as  actual  data  found  in  such  studies 
as  1,  2  and  3  could  be  converted  into  look-up  tables  to  determine  acquisition 
as  a  function  of  range,  atmosphere,  obscurants  and  image  to  determine  when  a 
target  was  acquired.  Moreover,  when  necessary,  interpolation  of  these  tables 
could  be  used  to  provide  the  needed  continuum  for  use  in  a  computer  model.  The 
use  of  such  information  could  be  used  to  advantage  in  assisting  modelers  in 
discriminating  between  systems  with  and  without  thermal  sights  as  well  as  for 
other  modeling  purposes. 


1.  USAARENBD,  Tank  Company  Team,  Night  Fight  Test,  1976. 

2.  USAARENBD,  Tank  Infrared  Elbow  (TIRE)  FDTE,  1976. 

3.  USAARENBD,  Tank  Thermal  Sight  (TTS),  OTII,  1978. 
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ABSTRACT 

Productivity  of  the  industrial  robots  are  directly  related  to 
the  speed  of  the  task  execution*  The  speed  of  the  robots  can  be 
drastically  improved  by  using  better  control  algorthims  and 

reducing  the  weight  of  the  manipulator. 

The  speed  of  a  robotic  manipulator  is  constrained  by 
manipulator  dynamics  and  actuator  capabilities.  Increasing  the  size  of 
the  actuators  is  not  a  solution  since  that  will  increase  the  weight  of 
the  the  overall  system  leading  to  a  relatively  heavier  system. The 
more  realistic  approach  to  the  problem  is  to  find  the  optimum 
control  solution  for  a  manipulator  to  follow  a  pre-defined  path  in 
minimum  time,  with  limited  actuator  capabilities. 

In  terms  of  the  dynamic  constraints,  the  weight  of  the  arms 
may  be  the  most  important  factor.  If  a  light-weight  arm  structure  is 
used,  actuators  will  be  able  to  afford  higher  speeds  during  the  task 
execution  than  they  would  for  rigid  arm  structure*  On  the  other  hand 
using  flexible-arms  has  a  major  draw*back  which  is  the  flexible 
vibrations,  while  increasing  the  speed* 

This  paper  presents  the  minimum  time  control  solution 
of  a  two  link  flexible  arm  with  actuator  constraints  .  We  solved 
the  minimum  time  problem  with  no  constraints  on  the  flexible  modes 
and  show  the  time  improvement  due  to  the  use  of  light-weight  arms* 

The  objective  is  to  modify  the  trajectory,  such  that  flexible  vibrations 
are  bounded  while  changing  the  solution  from  the  previous  one  as  little 
as  possible.  Practical  ways  of  trajectory  modifications  for  flexible  arms 
are  discussed. 

INTRODUCTION 

Today,  most  trajectory  planning  algorithms  do  not  consider  the  dynamics 
of  the  manipulators,  rather  constant  and/or  piece  wise  constant 
accelerations  for  the  overall  task  are  used  and  an  overall  maximum 
allowable  speed  is  set  [5,6,7].  However,  robotic  manipulators  are 
highly  nonlinear  dynamic  systems,  so  it  is  expected  that  affordable 
accelerations  and  decelerations  and  maximum  speeds  will  vary  as  a 
function  of  states.  For  the  traditional  schemes  to  work,  the  trajectory 
must  be  planned  for  the  worst  possible  case.  The  capabilities  of  the 
system  will  be  used  only  a  small  part  of  the  time*  Bobrow  et.al*  [l] 
first  reported  that  for  every  point  on  the  path  there  is  an  associated 
maximum  allowable  speed  and  maximum  affordable  acceleration  and 
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deceleration,  and  these  values  can  drastically  vary  from  one  state  to 
another.  Incorporating  the  ^nanipulator  dynamics  into  the  trajectory 
planning  level,  they  found  the  minimum  time  trajectories  for  different 
manipulator  models  [1,2]  with  limited  actuator  capabilities  moving  along 
pre-defined  paths.  Shin  and  McKay  [3]  solved  the  same  problem  independently. 

Light'-weight  manipulators  with  the  same  actuator  capabilities  will  be 
faster.  The  main  problem  associated  with  the  light-weight  structures 
is  the  flexible  vibrations.  Fig.  1  conceptually  shows  the  performance 
improvement  in  terms  of  increased  speed. 

In  this  paper  we  show  the  performance  improvements  due  to 

1,  use  of  light-weight  arms 

2,  incorporating  the  manipulator  dynamics  into  trajectory  planning  level 

3,  Discuss  flexible  vibrations  during  a  minimum  time  trajectory  execution 
and  considerations  of  path  modifications  such  that  flexible  vibrations 
will  be  bounded.  This  problem  is  similar  in  nature  to  the  one  raised  by 
Hollerbach  [8] - 

FLEXIBLE  MANIPULATOR  DYNAMIC  MODEL  IN  JOINT  AND  PATH  VARIABLES 
A  general  dynamic  modelling  technique  for  flexible  robotic  manipulators 
was  developed  by  Book  using  recursive  Lagrangian-assumed  modes  method. 
Homogeneous  transformation  matrices  are  used  for  kinematic  relations  of 
the  system  [4] .  A  two  link  flexible  robotic  manipulator  is  modelled 
using  that  technique  (Fig.  2).  In  the  model  no  actuator  dynamics  is 
considered,  rather  the  net  torque  input  to  the  links  is  considered  as  the 
input  variable.  No  friction  at  joints  nor  in  the  structural  vibrations  is 
considered*  Flexibility  of  each  link  is  approximated  with  one  assumed 
mode  for  each  link.  The  dynamic  model  of  the  manipulator  may  be  expressed 
in  general  terms  as  : 

CJ]4x4  q=f(q,q)  +  Q  (2-1) 

where  ^ 

rJ '  rpi  a  X  X  ~\  Joint  angle  and  flexible  mode  time  variables 

q. 

T  -I  Net  input  torques 

g':  [T,j2_0,0] 

Generalized  Inertia  Matrix  .symmetric, 

[J]  positive  definite. 

Nonlinear  dynamic  terms  including  centrifugal 
gravitational , effective  spring  and  Coriolis. 

The  problem  is  to  find  the  minimum  time  trajectories  for  a  given 
manipulator  with  limited  actuator  capabilities  moving  along  a  fixed 
path,  with  state  constraints  (bounded  flexible  vibration  constraint). 

Once  the  path  to  be  moved  along  is  specified 

S-S(x,y)  (2-2) 

From  inverse  kinematic  formulation,  the  corresponding  joint  angles  can 
be  found  as 

e=e  (s)  ,  [°1  ,'■’2,]  (2-3) 

Similarly,  once  the  speed  along  the  path  is  known  S(S) 
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0=0  (s,s) 


(2-^) 


and 


e=e  (s,s,s) 


(2-5) 


Knowing  the  relations  (2-3)-(2-5)  analytically  form  or  numerically  the 
manipulator  dynamics  in  part  can  be  expressed  in  path  variables. 


Cii (s,c) 


s  = 


2x1 


'1 

^2 

-1 


- 1 

Or 

_ t 

’  ^^33 

^^34 

-  ^43 

J44  - 

L 

3-g3+h^(s,s,e^) 
L  f4"g4+h2(s,s,e^) 


where  f  .  =f  .  ( s , S . 6 , 6  ) 
-  gi=gi(s,s,e^ 


Czi (s,s,6,6,e^^e^^p) 

C22(s,s,6,5,e^^e^^p) 

(2-6a) 

(2-6b) 

(2-7) 
(2-8) 
(2-9) 


^n:  tangent  and  normal  vectors  along  the  path. 

„  ,  Curvature  of  the  path  at  a  point. 

Notice  that  flexible  modes  also  affect  the  position  of  the  end  effector, 
but  are  not  included  in  the  definition  of  the  path.  This  is  mainly  due 
to  the  fact  that  we  do  not  have  a  "direct"  control  on  the  flexible 
vibrations  and  would  like  to  keep  them  as  small  as  possible  in  general. 


FORMULATION  OF  THE  NEAR  MINIMUM  TIME  TRAJECTORY  PROBLEM  FOR  FLEXIBLE 
MANIPULATORS 

Using  the  classical  variational  calculus  principles,  the  optimum  control/ 
programming  problem  may  be  stated  as: 


Minimize 


r  r 

=  1  dt=  I  ^ 

0  so 

S(So)  =  So 


(3-1) 


S.(s^)  -  Initial  and  final  states  in  path  variables. 

Subject  to  : 

System  dynamics,  equations  (2-6a)  and  (2-6b) 


Actuator  constraints 

(e,0)  <  T.  <  T.  (0  6)  i  =  1  ,2 
n'n  ■  "  ■  ■  >- 

na^nic  inequality  constraints  on  flexible  modes 

a.(t)<  6^(t)  <  b.(t)  i=l,2 
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(3-2) 


(3-3) 


The  constraints  (3“3)  naturally  arise  in  flexible  structures. 

If  such  a  constraint  is  not  imposed  there  is  no  guarantee  on  the 
accuracy  of  the  end  point  along  the  path.  At  first  the  problem  will  be 
solved  without  considering  these  constraints.  This  solution  will  be 
used  as  a  nominal  solution  for  the  trajectory  modification  step  so  that 
(3^3)  are  satisfied. 

The  solution  method  we  use  closely  follows  Bobrow  et.al.'s  method  with 
some  modifications  for  flexible  manipulators.  The  solution  of  the 

above  stated  optimization  problem  follows:  for  any  path  S(x,y)  with 
given  So  (So ) » (S^ )  to  minimize  J,  S  should  be  as  large  as  possible 
while  satisfying  the  system  dynamics  and  actuator  constraints.  In 
order  to  do  so  at  any  state  on  the  path  one  should  use  maximum 
acceleration  or  deceleration.  Then,  the  problem  is  reduced  to  finding 
the  maximum  accelerations  and  decelerations  associated  with  each  state 
of  interest.  It  can  be  seen  from  equation  (6a>  that  for  each  (S  ,S  ) 

S  .  <  S  <  S,  <3-A) 

d  -  -  a 

=  min 

a 

Sj  =  max 
d 

Obviously  there  may  be  some  range  of  speeds  associated  with  every  point 
on  the  path  that  system  can  no  longer  afford  to  satisfy  all  conditions 
(the  S  range  that  above  inequality  is  violated).  Collection  of  these 
ranges  defines  the  forbidden  region  on  (S  ,S  )  plane.  The  boundary 
between  allowed  and  forbidden  regions  is  constant  for  a  given  rigid 
manipulator  for  a  given  task.  In  the  case  of  flexible  manipulators,  due 
to  the  coupling  between  equations  (6a)  and  (6b)  this  boundary  is  also  a 
function  of  flexible  modes, not  only  (S  ,S  ),  So,  depending  on  the  time 
history  of  flexible  modes  and  unpredictable  disturbances  the  boundary 
will  vary.  This  is  not  true  in  the  rigid  case  where  the  true  extremum 
can  be  found.  At  this  point  the  problem  is  to  find  when  to  use  maximum 
accelerations  and  when  maximum  decelerations  (i,e,  to  find  the 
switching  point  (s)) .See  Fig.  3a“3b. 

Finding  switching  points  for  flexible  manipulators: 

1. Integrate  S=S(x,y)  from  final  state  backward  in  time  until  it 
crosses  forbidden  region  or  initial  position,  using  maximum 
deceleration* 

2,  Integrate  §(x,y)  Forward  in  time  with  maximum  acceleration  until 
the  boundary  is  reached  or  the  two  curves  crossed  each  other.  If  the 
two  curve  crossed  each  other  before  they  enter  forbidden  region,  then 
find  that  point.  This  is  the  last  switching  point  and  terminate  the 
search*  If  not,  then 

3.  Backup  on  the  forward  integrated  curve  and  integrate  forward  with 
maximum  deceleration  until  a  the  trajectory  passes  tangent  to  the 
boundary. 

A. Then  using  the  point  as  new  starting  point  go  to  step  two. 

Notice  that  the  last  switching  point  is  not  the  exact  switching  points, 
because  the  flexible  modes  will  not  match  at  this  point.  That  will 
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cause  one  to  miss  the  final  state  somewhat*  Also,  when  searching  for 
the  switching  points  one  has  to  move  in  a  continuous  manner  in  order  to 
keep  track  of  the  flexible  mode  histories  accurately.  In  that  sense, 
the  algorithm  given  at  [l]  has  been  modified  for  flexible  robotic 
manipulators . 


SIMULATION  RESULTS  AND  DISCUSSION 

The  two^link  flexible  manipulator  model  for  task  one  (shown  in  Fig,  Aa) 
was  simulated  for  the  two  different  cases  in  order  to  show  the 
performance  improvement  achieved  due  to  light”weight  system.  In  both 
cases  actuators  have  same  capabi 1 i t ies .  It  is  found  that  weight 
reduction  by  a  factor  of  2  results  in  approximately  60  7,  time  improvements 
(Fig.  5a  and  6a),  This  improvement,  of  course,  slightly  varies  depending 
on  the  task.  Joint  actuator  histories  are  shown  in'Fig.  5b-6c  and  flexible 
mode  responses  are  shown  in  Fig.  5c-6d. 

Task  2  (Shown  in  Fig,  4b)  simulated  for  light-weight  manipulator  and 
results  are  shown  Fig  7  a—d.  The  final  trajectory  is  shown  in  heavy 
lines.  One  interesting  point  in  this  simulation  is  the  fact  that  as 
soon  as  the  manipulator  end  point  enters  the  curvature  the  system  must 
accelerate  along  the  path  in  order  to  obey  the  constraints.  In  Fig,  5a 
the  curve  ab  shows  that  right  before  the  curvature  the  system  is  able 
to  afford  deceleration  (aa'  curve),  but  as  end  point  enters  the 
curvature,  then  the  sudden  appearance  of  a  normal  acceleration  term  in 
the  dynamics  of  the  system  makes  the  di f f er ence , The  other  point  in 
the  case  of  flexible  arms  is  that  at  the  last  switching  point  flexible 
modes  are  not  same, since  thay  have  different  histories.  This  will  cause 
error  in  the  final  state  reached.  See  Fig.  6a,  7a-  The  last  switching 
point  needs  to  be  varied  from  the  original  result  of  the  above  algorithm. 
This  can  be  done  on  trial  and  error  basis  at  the  trajectory  planning  level. 

5. CONCLUSION  AND  FURTHER  WORK 

In  this  paper  we  showed  ways  to  improve  performance  and  productivity 
of  Robotic  manipulators  With  Flexible  arms. One  way  was  to  use 
light-weight  structures  and  the  other  was  to  incorporate  the  dynamics 
of  manipulators  in  to  trajectory  planning  level  and  make  optimum 
utilisation  of  given  manipulator.  This  method  can  be  used  with  any 
path* Application  of  the  method  requires  manipulator  model , Geometric 
path  in  work  space, and  actuator  capabilities.  Obvoiusly  as  trajectory 
gets  closer  to  the  forbiden  region  boundary  system  capabilities  are 
being  used  to  the  limits  and  any  disturbance  or  uncertainty  can  easliy 
put  the  system  into  forbiden  region  .The  situation  is  more  dramatic 
for  flexible  manipulators «Whi le  this  analysis  is  nice  in  terms  of 
knowing  the  ultimate  capabilities  ,  in  practice  there  will  be  a  saftey 
factor  that  will  require  to  keep  the  optimal  trajectory  away  from  the 
forbideen  region  certain  amount . Research  is  in  progress  on  the  Optimum 
modification  of  the  trajectories  found  by  above  described  method  so 
that  inequality  constraints  on  the  flexible  modes  will  be  satisfied. 
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Fig.  5.C  Flexible  modes  time  variables 


Fig.  G.a  Trajectory  of  lightweight  arms. 


Find  Switch  P. 


Fig.  6.b  Finding  the  switching  point 
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Fig.  6.C  Torque  histories  of  lightweight  arms 
along  path  1, 


Fig.  7b.  Trajectory  for  path  2. 


DYNAMICS  OF  FLEXIBLE  MECHANICAL  SYSTEMS 


Wan  S.  Yoo  and  Edward  J.  Haug 
Center  for  Computer  Aided  Design 
and 

Department  of  Mechanical  Engineering 
College  of  Engineering 
The  University  of  Iowa 
Iowa  City,  Iowa  52242 


ABSTRACT.  A  finite  element  based  method  is  developed  and  applied  for 
geometrically  nonlinear  dynamic  analysis  of  flexible  systems.  Vibration 
and  static  correction  modes  are  used  to  account  for  linear  elastic 
deformation  of  components.  Boundary  conditions  for  vibration  and  static 
correction  mode  analysis  are  defined  by  kinematic  constraints  between 
components  of  a  system.  Constraint  equations  between  flexible  bodies  are 
derived  and  a  Lagrange  multiplier  formulation  is  used  to  generate  the 
coupled  large  displacement-small  deformation  equations  of  motion.  A 
standard,  lumped  mass  finite  element  structural  analysis  code  is  used  to 
generate  deformation  modes  and  deformable  body  mass  and  stiffness 
information.  An  intermediate-processor  is  used  to  calculate  time- 
independent  terms  in  the  equations  of  motion  and  to  generate  input  data 
for  a  large  scale  dynamic  analysis  code  that  includes  coupled  effects  of 
geometric  nonlinearity  and  elastic  deformation.  Two  examples  are 
presented  and  the  effects  of  deformation  mode  selection  on  dynamic 
prediction  are  analyzed. 

I.  INTRODUCTION.  Developments  in  multibody  dynamics  have  come  from 
two  principle  sources;  mechanism  dynamics  and  spacecraft  dynamics.  Early 
developments  in  the  field  of  flexible  mechanisms  are  typified  by  Refs.  1- 
5.  While  numerous  technical  differences  exist  between  the  methods  used  in 
these  studies,  the  pervasive  assumption  made  is  that  large  amplitude  gross 
motion  dynamics  can  be  uncoupled  from  small  amplitude  elastodynamics  of 
the  system. 

Attempts  have  been  made  to  account  for  coupling  between  gross  motion 
and  elastodynamics  using  continuum  models  and  deriving  special  purpose 
equations  of  motion.  A  mechanism  example  that  typifies  this  approach  is 
presented  in  Ref,  6.  A  related  approach  to  spacecraft  dynamics  may  be 
found  in  Refs.  7-9, 

Essential  limitations  associated  with  the  assumptions  of  uncoupling 
gross  motion  and  vibration,  using  elastic  continuum  models,  has  led 
workers  in  both  fields  of  flexible  mechanism  dynamics  and  spacecraft 
dynamics  to  formulations  that  employ  finite  element  based  techniques  to 
represent  flexibility  and  coupling  these  effects  with  gross  motion 
dynamics.  Bodley  and  co-workers  developed  a  computer  program  called 
DISCOS  [lOj  for  spacecraft  dynamics  and  control,  including  flexibility 
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effects  through  finite  element  modal  analysis.  Related  developments  in 
spacecraft  structural  dynamics  with  open  loops  may  be  found  in  Refs,  11 
and  12.  A  substantial  extention  of  these  methods,  using  relative  joint 
coordinates  and  accounting  for  closed  loops,  has  recently  been  presented 
by  Keat  [13]. 

Finite  element  based,  fully  coupled  dynamic  formulations  for  machine 
dynamics  began  in  the  late  1970 's  and  are  reflected  by  the  early  papers  of 
Refs.  14  and  15.  Song  [14]  presented  a  general  formulation  for  planar 
system  dynamics  that  incorporated  flexible  finite  elements  into  a  general 
purpose,  rigid  body  mechanism  dynamics  code.  On  the  other  hand,  Sunada 
and  Dubowsky  [15]  used  a  lumped  mass  finite  element  model  of  structural 
components  and  developed  the  equations  of  motion  for  selected  machines. 
Their  work  provided  substantial  insight  into  use  of  finite  element  data  in 
generating  the  equations  of  motion.  Shabana  and  Wehage  [16]  extended  the 
method  of  Ref.  14,  to  include  vibration  modes  to  model  flexible  bodies  in 
a  general  purpose  dynamics  code.  The  key  contribution  from  this  work  was 
clear  demonstration  that  flexibility  effects  can  be  included  in  a  general 
purpose  spatial  dynamics  code.  The  limitation  of  the  approach  of  Ref.  16 
is  that  flexible  bodies  must  be  made  up  of  collections  of  finite  elements 
that  are  imbedded  in  the  analysis  code. 

The  purpose  of  this  paper  is  to  present  and  illustrate  use  of  a 
finite  element  based  numerical  method  for  dynamic  analysis  of  mechanical 
systems  that  contain  complex-shaped,  flexible  bodies.  To  achieve  this 
goal,  the  following  approach  is  employed; 

(1)  Vibration  and  static  correction  deformation  modes  are  used  to 
define  kinematically  admissible  deformation  fields  in  elastic 
components. 

(2)  Constraint  equations  between  flexible  bodies  are  derived. 

(3)  The  equations  of  motion  are  derived,  using  lumped  mass  finite 
element  approximations. 

(4)  A  standard  finite  element  structural  analysis  code  is  used  for 
vibration  and  static  correction  mode  analysis  of  each  flexible 
body.  A  preprocessor  calculates  time-independent  terms  in  the 
equations  of  motion  and  generates  input  data  for  geometrically 
nonlinear  dynamic  analysis  of  flexible  systems. 

II.  G^ERALIZED  COORDINATES  AND  KINETIC  ENERGY.  In  order  to  specify 
the  state  of  a  body,  it  is  necessary  to  define  a  set  of  generalized 
coordinates  that  uniquely  locate  every  point  in  the  body  in  space.  The 
XYZ  reference  frame  shown  in  Fig.  1  represents  an  inertial  reference  frame 
and  the  xyz  frame  is  a  reference  frame  for  a  typical  body,  which  need  not 
be  fixed  to  the  body  as  it  deforms.  The  xyz  frame  is  defined  to  be  fixed 
to  the  body  in  its  undeformed  state;  i.e.,  points  on  the  body  can  be 
defined  relative  to  the  xyz  frame,  prior  to  deformation.  If  the 
distributed  mass  around  node  i  is  replaced  by  a  point  mass  at  that  node, 
which  is  called  lumped  mass,  that  mass  has  no  rotary  inertia. 
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X 

Figure  1.  Reference  Generalized  Coordinates 

Consider  node  point  i  on  the  body,  defined  by  the  vector  in  the 

xyz  frame  in  the  undeformed  state.  Let  be  a  projection  matrix  that 

extracts  the  xyz  coordinates  of  the  displacement  u"*  of  point  i  due  to 
deformation;  i.e., 

u^’  P^  u  (1) 

where  u  is  the  nodal  displacement  vector  for  the  entire  body,  which 
contains  both  nodal  displacements  and  rotations.  After  deformation,  point 
i  is  located  in  the  xyz  frame  by 

r^  *  rj  +  u’  =  r j  +  P%  (2) 

where  r^  and  are  vectors  from  the  origin  of  the  xyz  frame  to  point  i 

in  the  deformed  and  undeformed  states  respectively.  Global  XYZ 
coordinates  of  point  i  are  thus 

=  R  +  A(rQ  +  P%)  (3) 

where  A  is  the  transformation  matrix  from  the  xyz  frame  to  the  XYZ  frame, 
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In  order  to  uniquely  characterize  the  state  of  the  body,  in  terms  of 
generalized  coordinates  of  the  reference  frame  and  deformation  mode 
coordinates,  one  must  exclude  modes  corresponding  zero  frequencies. 
Otherwise,  rigid  body  motion  is  represented  by  both  R  and  the  modal 
coordinates  and  uniqueness  of  state  as  a  function  of  generalized 
coordinates  is  lost. 

In  terms  of  deformation  mode  coordinates,  one  can  calculate  the 
velocity  of  point  i  from  Eq.  3  as 

=  ft  +  A(rQ  +  pSa)  +  AP^'ya  (4) 

where  f  and  a  are  modal  matrix  and  modal  coordinate  vector, 
respectively.  Using  Euler  parameters  [17]  (See  Appendix)  as  rotational 
generalized  coordinates  of  the  xyz  frame,  relative  to  the  XYZ  frame,  the 
absolute  velocity  becomes 

ft^  =  ft  -  2Ef^p  +  AP^fa 

=*  ft  -  2Ar^Gp  +  AP^Ya  (5) 

where  p  is  the  (4x1)  vector  of  Euler  parameters,  r^  is  any  (3x1)  vector 

defined  in  the  xyz  frame,  r^  and  are  (4x4)  and  (3x3)  matrices  composed 

of  the  elements  of  r^,  and  E  and  G  are  (3x4)  matrices  defined  by  the 
elements  of  p  (see  Appendix  for  details).  Summing  the  kinetic  energy  of 

masses  m^  lumped  at  nodes  i  over  the  total  number  N  of  nodes  yields  the 
kinetic  energy  of  the  body  [18]  as 


(6) 


where 
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(I  mOlo  -2E  ^  m.r^ 

i=l  ^  i=l  ' 


M  = 


T  N  .T  . 
4G^(  lf,.r^  r^]G 
i=l  ^ 


symmetric 


N 

A  I  m.P^'r 
i=l  ' 


-2  I  m.G^r^  p’’^ 
i=l 


+  Zp  I  m.r^  P^i* 

i=l  ^ 

I  m.(P^f)^(pV) 

i*l  I 


Note  that  vector  r^  appearing  in  the  mass  matrix  depends  on  both  reference 
and  modal  coordinates,  so  it  is  not  constant.  Evaluation  of  M  at  each 
time  step  of  a  simulation  requires  expansion  of  terms  that  involve 
generalized  coordinates.  An  efficient  method  of  carrying  out  the 
expansion  and  evaluating  M  is  presented  in  Refs.  18  and  19. 

III.  STRAIN  ENERGY.  The  strain  energy  of  a  deformable  body  is 
calculated  in  the  finite  element  code  as 

U  =  2  u^Ku  (7) 

where  K  is  the  structual  stiffness  matrix.  Using  modal  matrix  and 
coordinates,  Eq.  7  becomes 

U  =  2®^'l'^KWa  =  ja^K^^a  (8) 


where 


aa 


IV.  GENERALIZED  FORCE.  Let  be  an  external  force  acting  on  node  i 
of  a  body.  The  virtual  work  of  all  such  forces  acting  on  the  body  can  be 
written  as 


6W  = 


N  .1  . 

I  6R^ 
i=l 


(9) 


The  total  differential  of  Eq.  3  can  be  used  to  calculate  the  virtual 
displacement  of  node  i  as 

6R^  =  6R  -  2Ar^G6p  +  AP'''l'<Sa  (10) 

Equation  9  can  now  be  written  as 
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(11) 


where 


6R 

6p 

5a 


N  . 

=  ,-.v 


T-'ipT^I 


Qp  =  2G'r'A'F 


N  .  T  . 

Qa  "  .1  (AP  ’f) 


i=l 


(12) 


Forces  due  to  spring,  damper,  and  actuator  elements  are  included  in  the 
generalized  force  vector  Q. 

V,  EQUATIONS  OF  MOTION  OF  A  FLEXIBLE  BODY.  The  equations  of  motion 
of  a  flextble  body  with  no  kinematic  constraints  can  be  written  as 


—  —1  - 
3q 


•3Ti 


(11)' 


'■aq-' 


+  J'X  =  Q 


(13) 


where  T  is  kinetic  energy  of  the  body,  q  =  [R^  ,  p^,  a^]^,  U  is  total 
strain  energy  of  the  body,  Q  is  generalized  force  acting  on  the  body,  J  is 

the  Jacobian  matrix  of  the  Euler-parameter  constraint  equation  p^p  =  1 
[17],  and  X  is  a  Lagrange  multiplier  associated  with  that  constraint. 


Expanding  the  kinetic  energy  expression  of  Eq.  6  and  using  the  strain 
energy  expression  of  Eq.  8,  the  equations  of  motion  of  the  body  are 
formulated  [18].  In  these  equations  of  motion,  many  calculations  are 
required  to  evaluate  all  terms.  If  terms  are  partitioned  into  time- 
dependent  and  time-independent  subsets,  the  amount  of  calculation  is 
substantially  reduced  [18].  Time-independent  terms  are  precalculated  in  a 
subroutine  that  takes  input  from  an  established  finite  element  structual 
analysis  code.  The  resulting  values  are  then  read  into  the  geometrically 
nonlinear  dynamics  code  to  form  the  equations  of  motion  (Eq.  13)  of  the 
body.  For  a  detailed  development  of  the  flexible  body  equations  of  motion 
and  a  discussion  of  methods  used  to  generate  deformation  modes  using  a 
finite  element  code,  the  reader  is  referred  to  Refs.  18  and  19. 
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VI.  EQUATIONS  OF  MOTION  OF  A  CONSTRAINED  SYSTEM,  System  equations 
of  motion  can  be  obtained  by  combining  equations  of  motion  of  each  body 
and  equations  of  constraint  between  bodies,  using  Lagranges  equations  of 
motion,  as 


d 

dt 


/ii\^ .  /ilf  +  mV  + 
Va^j  W  VV 


fx 


(14) 


where  T  is  total  kinetic  energy  of  the  system,  q  is  the  complete  set  of 
generalized  coordinates  for  the  system,  □  is  the  strain  energy  of  the 
system,  Q  is  the  complete  vector  of  generalized  forces,  J  is  the  Jacobian 

matrix  of  the  kinematic  constraint  equations  (including  p^p  =1  for  each 
body),  and  X  is  the  complete  vector  of  Lagrange  multipliers.  This  system 
of  equations  can  be  systematically  assembled  from  Eq.  13  for  each  body  and 
the  constraint  equations  derived  in  Section  7. 

VII.  CONSTRAINT  FORMULATION.  Kinematic  constraints  between  flexible 
bodies  must  account  for  deformation  of  bodies  that  are  connected. 
Otherwise,  derivation  of  constraint  equations  proceeds  as  in  the  case  of 
rigid  body  systems.  In  this  paper,  the  equations  for  spherical, 
universal,  and  revolute  joints  are  formulated  for  rigid  and  flexible 
bodies. 


(a)  Spherical  Joint  (SPHR).  A  spherical  joint  at  point  P  between 
two  adjacent  bodies  i  and  j  is  shown  in  Fig,  2.  A  vector  equation  that 
requires  point  P  to  be  common  to  both  bodies  is 

R,  t  A,s;  .  Rj  -  AjSj  =  0  (15) 

where  s!  and  s',  are  vectors  from  the  origins  of  the  ith  and  jth  local 
^  J 

reference  frames  to  point  P.  Let  si  and  s',  be  values  of  si  and  si  in 

ig  Jq  1  j 

the  undeformed  state,  measured  in  the  respective  body  reference  frames. 


If  both  bodies  are  flexible,  si  and  si  depend  on  elastic  deformation 

*  U 

and  must  be  evaluated  at  each  deformed  state.  If  modal  coordinates  are 
employed  to  represent  elastic  deformation,  Eq,  15  can  be  written  as 


R.  +  Ajsl  +  (P'^'ya)  J 

1  1  Iq  1 


[si  +  (P'^'ya).]  =  0 
Jq  j 


(16) 


where  A.,-  and  Aj  are  transformation  matrices  from  the  ith  and  jth  local 

reference  frames  to  the  global  frame  and  k  is  the  number  of  the  node  at 
which  the  spherical  joint  is  located.  This  joint  generates  three 
algebraic  constraint  equations. 

(b)  Universal  Joint  (UNIV).  A  universal  joint  between  adjacent 
bodies  i  and  j  is  shown  in  Fig.  3.  Three  points,  P  in  both  bodies,  I  in 
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Figure  2.  Spherical  Joint 
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body  i,  and  J  in  body  j  are  chosen  to  define  a  universal  joint.  Point  P 
must  satisfy  the  spherical  joint  equation  of  Eq.  16.  The  constraint  that 
requires  vectors  g^-  and  gj,  defined  in  Fig.  3,  to  be  perpendicular  is  that 

their  scalar  (dot)  product  is  zero,  denoted  as  the  DOTl  constraint, 

g/9j  ■  0  (17) 

where  g^  and  g^  are  vectors  from  point  P  to  points  I  and  J  in  the  global 
frame. 


When  both  bodies  are  flexible,  elastic  deformation  should  be 
considered.  To  include  the  effect  of  rotational  deformation,  define  body- 
fixed  and  frames  at  point  P  in  each  body.  It  is  assumed 

I  I  I  J  J  J 

that  the  structure  near  the  joint  is  sufficiently  stiff  so  that 
vectors  gV  and  g'l  defined  in  the  and  frames  remain 

1  J  II  \J  \l  V 

constant.  Then,  Eq.  17  can  be  written  as 

(A.B.gJ)^(AjBjgj!)  =0  (18) 

where  and  A^-  are  transformation  matrices  from  to  x^-y^z^-  frames 

and  from  x^y^z^-  to  XYZ  frames,  respectively.  The  same  notations  are  used 
in  defining  matrices  B.-  and  A^  for  body  j. 


(c)  Revolute  Joint  (RVLT).  A  revolute  joint  between  adjacent  bodies 
i  and  j  is  shown  in  Fig.  4.  Three  points,  P  in  both  bodies,  I  in  body  i, 
and  J  in  body  j,  are  chosen  on  the  common  joint  axes.  This  joint  requires 
that  point  P  should  be  common  to  both  bodies  and  that  vectors  g^-  and  gj 

must  be  parallel.  Hence,  point  P  must  satisfy  the  spherical  joint 
equation  of  Eq.  16. 

Constraint  equations  that  require  vectors  g^  and  gj  in  Fig.  4  to  be 
parallel  are  that  gj  be  orthogonal  to  two  perpendicular  vectors  hj  and  h2 
that  are  imbedded  in  body  i  and  are  perpendicular  to  g^.  Those 
constraints  can  be  formulated  as  two  OOTl  constraints. 


hjg.  =  o'! 


(19) 


VIII.  SELECTION  OF  DEFORMATION  MODES.  In  modal  deformation 
approximate  methods,  a  few  of  the  lowest  frequency  natural  vibration  modes 
are  normally  used.  This  often  requires  a  large  number  of  vibration  modes 
(i.e.,  high  frequency  modes)  to  represent  local  deformation  effects  due  to 
concentrated  loads.  Developments  in  structural  dynamics  [20-22]  suggest 
that  another  kind  of  deformation  mode  should  be  introduced  to  approximate 
the  effects  of  high  frequency  modes. 
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In  mechanical  system  dynamics,  there  are  differences  from 
conventional  structural  dynamics.  Kinematic  constraints  and  nonlinear 
spring-damper  elements  are  frequently  attached  between  two  adjacent 
elastic  bodies  and  small  elastic  deformation  is  highly  coupled  with  the 
geometrically  nonlinear  global  motion  of  the  system.  To  consider  the 
local  deformation  due  to  forces  transmitted  by  these  elements,  deformation 
modes  are  defined  by  imposing  unit  forces  in  the  direction  of  reaction  or 
spring  forces  that  are  expected  to  cause  significant  deformation  of  the 
body.  The  resulting  deformation  is  called  a  static  correction  mode.  A 
detailed  derivation  of  static  correction  modes  for  application  to  machine 
dynamics  can  be  found  in  Ref.  18. 

If  one  chooses  n  vibration  normal  modes  from  vibration  analysis  and 
m  static  correction  modes,  the  elastic  displacement  u  can  be  written  as 
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u 


(20) 


=  a_  + 
n  n  lii  ni 

where  and  are  mode  shapes  from  vibration  and  static  analysis, 
n  m 

respectively,  and  a_  and  a^  are  amplitudes  of  these  modes.  The 

n  m 

matrices  'r_  and  are  combined  to  form 
n  m 

y  »  [«!»_  ¥_)  (21) 

n  m 

IX,  COMPUTER  IMPLEMENTATION.  To  practically  implement  the  theory 
presented  in  this  paper,  a  finite  element  code  with  the  capability  of 
carrying  out  both  vibration  and  static  analysis  is  employed.  Most  finite 
element  codes  can  be  used  for  this  purpose.  The  SPAR  Structural  Analysis 
System  was  used  for  applications  presented  in  this  paper. 

After  vibration  and  static  correction  mode  analysis  is  complete,  the 
modal  matrix  is  composed  of  vibration  normal  modes  and  static  correction 
modes  (Eq.  21).  An  intermediate  processor  is  used  to  generate  modal 
matrices  and  time-independent  terms  in  the  equations  of  motion.  Output 
from  the  intermediate  processor  is  read  as  input  data  in  the  DADS  dynamic 
analysis  code.  Using  the  input  data,  the  DADS  program  formulates  the 
equations  of  motion  of  the  system  (Eq.  14),  which  are  then  integrated 
using  a  variable-step,  variable-order  numerical  integration  algorithm. 

For  additional  details  regarding  implementation,  see  Refs,  18  and  19. 

X.  NUMERICAL  EXAMPLES. 

(a)  Flexible  Component  Door  Closing  Mechanism.  A  flexible  door  is 
attached  to  a  rigid  ground  by  two  beams  and  revolute  joints,  as  shown  in 
Fig.  5,  If  the  rotational  axes  of  the  revolute  joints  are  parallel  to  the 
Z-axis,  the  door  can  freely  rotate.  If  the  revolute  joints  are  not 
parallel,  the  door  must  deform  to  rotate.  In  this  example,  the  rotational 
axes  of  the  revolute  joints  remain  in  the  X-Z  plane  and  make  an  angle  of  5 
degrees  with  the  Z-axis.  The  door  is  initially  rotated  15  degrees.  To 
make  the  body  of  the  door  much  stiffer  than  the  beams,  the  Modulus  of 
Elasticity  of  the  door  plate  is  assigned  to  be  10  times  greater  than  that 
of  the  beams.  Material  properties  and  dimensions  of  the  mechanism  are  as 
follows; 

Mass  density,  p  =  7.83  *  10^  Kg/m^ 

Height  of  the  plate,  H  =  0.8  m 
Width  of  the  plate,  W  =  0.6  m 
Modulus  of  Elasticity  of  the  plate,  E2 
Thickness  of  the  plate,  t  =  0.005  m 
Length  of  the  beams,  L  =  0.05  m 
Modulus  of  Elasticity  of  the  beams,  El 
Radii  of  the  beams,  r  =  0.003  m. 


=  2.0  *  10^2 
=  2.0  *  lO^l  N/m^ 
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Figure  5.  Flexible  Door  With  Initial  Rotation  of  Angle  15° 


The  finite  element  model  of  the  door  is  shown  in  Fig.  6.  The  plate 
is  divided  into  12  membrane+bending  elements  and  each  beam  is  divided  into 
10  beam  elements  of  equal  length.  Boundary  conditions  for  vibration 
analysis  are  chosen  such  that  three  translational  coordinates  at  node  40 
are  fixed  and  X-  and  Z-direction  translational  coordinates  at  node  30  are 
fixed.  Since  only  five  constraints  are  imposed  for  vibration  analysis, 
there  is  one  rigid  body  mode  of  vibration.  The  first  ten  natural 
frequencies  from  vibration  analysis  are  0.0,  62.08,  126.29,  163.53, 

227.95,  237.84,  375.90,  477.91,  499.10,  and  533.48  rad/sec. 
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N5 


I - 1 - . - ifjl 

NI6  Nil  N6 


N  *  * :  Node  Number  *  * 

Plate  ;  12  (Membrane  +  Bending)  Elements 

E  =  2.0  xio'^  N/m^ 

Beam  10  Beom  Elements  ot  each  beam 
E=2.0xl0"  N/m^ 


Figure  6,  Finite  Element  Model  of  Flexible  Door 


Since  the  five  constraints  for  vibration  analysis  are  not  sufficient 
to  suppress  rigid  body  motion,  one  additional  constraint  is  imposed  to 
define  static  correction  modes.  The  Z-direction  translational  coordinate 
at  node  8  is  fixed  as  one  additional  constraint*  Since  a  revolute  joint 
allows  one  relative  rotational  degree  of  freedom  about  its  joint  axis, 
there  are  five  reaction  components  at  each  revolute  joint.  In  this  model, 
however,  five  reaction  components  (2  forces  at  node  30  and  3  forces  at 
node  40)  are  already  accounted  for  in  vibration  analysis.  The  other  five 
reaction  components  (two  torques  at  nodes  30  and  40  and  one  force  at  node 
30)  are  chosen  to  define  static  correction  modes,  as  shown  in  Fig.  7.  At 
node  30,  the  first  static  correction  mode  is  due  to  a  unit  torque  in  the 
Z-direction,  the  second  is  due  to  a  unit  force  along  the  rotational  axis. 
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and  the  third  is  due  to  a  unit  torque  along  the  axis  in  the  X-Y  plane 
perpen-dicular  to  the  rotational  axis.  At  node  40,  the  first  mode  is  due 
to  a  unit  torque  in  the  Z-direction  and  the  second  is  due  to  a  unit  torque 
along  the  axis  in  the  X-Y  plane  perpendicular  to  the  rotational  axis. 


Figure  7.  Forces  Defining  5  Static  Correction  Modes 


To  see  the  vibratory  motion  of  the  door  due  to  an  initial  15  degree 
rotation,  the  door  is  released  from  the  deformed  position.  Analysis  was 
carried  out  with  5N  (5-normal  modes),  5S  (5-static  correction  modes),  9N 
(9-normal  modes),  and  4N5S  (4-normal  +  5-static  correction  modes) 
models.  The  static  correction  modes  in  the  5S  and  4N5S  solutions  are 
inertia  relief  and  residual  inertia  relief  attachment  modes,  respectively. 
Inertia  relief  attachment  modes  are  defined  by  subtracting  a  portion  of 
the  rigid  body  mode  from  the  attachment  modes.  Residual  inertia  relief 
attachment  modes  are  then  defined  by  subtracting  the  contribution  of  the 
kept  normal  modes  from  these  inertia  relief  attachment  modes. 

The  X-coordi nates  of  node  8  in  the  5N  and  5S  solutions  are  shown  in 
Fig.  8.  To  determine  the  reason  for  the  difference  in  frequency  between 
the  5N  and  5S  solutions,  strain  energies  at  the  initially  deformed  states 
are  compared.  The  initial  strain  energy  is  calculated  as  S.E.  = 

(a^K,  a)/2,  where  a  is  the  vector  of  modal  coordinates  at  the  initially 

deformed  configuration  and  Kgg  is  the  modal  stiffness  matrix.  The  initial 

strain  energy  of  the  5N  state  is  57.6  N»m  and  that  of  the  5S  state  is 
0.562  N»m.  Because  of  the  high  initial  strain  energy  in  the  5N  state,  the 
frequency  of  the  5N  solution  is  much  higher  than  that  of  the  5S  solution. 
Considering  the  minimum  potential  energy  principle,  for  the  initial 
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equilibrium  configuration,  it  can  be  seen  that  the  5S  deformed  state  is 
closer  to  the  actual  initial  deformation  than  that  of  5N  state.  Thus,  the 
5S  solution  is  expected  to  be  more  accurate  than  the  5N  solution. 


- •  5N  SOLUTION 

••••*  5S  SOLUTION 


Time  (  sec  ) 


Figure  8.  X-coordi nates  of  Mode  8  in  the  5N  and  5S  Solutions 


Nine  normal  mode  (9N)  and  4N5S  solutions  are  compared  with  the  5S 
solution  in  Fig.  9.  Although  the  number  of  normal  modes  is  increased  to 
nine  in  the  9N  solution,  the  solution  is  still  far  from  the  5S  solution. 
This  means  that  nine  normal  modes  are  not  sufficient  in  this  example. 
There  is  almost  no  difference  between  results  of  the  5S  and  4N5S 
solutions.  This  means  that  the  attachment  modes  are  dominant  in  this 
example. 

Simulation  times  on  a  PRIME  750  computer  and  RMS  (Root  Mean  Square) 
integration  stepsizes  are  given  in  Table  1,  Since  the  frequency  of  the 
normal  mode  solution  is  much  greater  than  that  of  the  5S  solution,  its 
computing  time  is  extreme.  It  is  difficult  to  estimate  the  number  of 
normal  modes  that  would  be  required  for  an  accurate  solution.  Certainly, 
computing  time  would  be  astronomical. 
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coord,  of  node  8 


9N  SOLUTION 


5S  SOLUTION 
4N5S  SOLUTION 


Figure  9,  X-coordi nates  of  Node  8  in  the  9N,  5S,  and  4N5S  Solutions 
Table  1.  Comparison  of  Simulation  Times 


Type 

T  end 

[sec] 

CPU 

[sec] 

RMS  integration 
stepsize 
[sec] 

5S  solution 

2.0 

106 

0.47812E-01 

5N  solution 

2.0 

401 

0.90934E-02 

9N  solution 

2.0 

7471 

0.65035E-03 

4N5S  solution 

2.0 

7281 

0.75491E-03 
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(b)  Windshield  Wiper  Mechanism.  A  model  of  a  two  blade  windshield 
wiper  mechanism  that  consists  of  6  bodies  is  shown  in  Fig.  10.  The  crank 
arm  (body  2),  drive  link  (body  3),  and  connecting  link  (body  5)  are  modeled 
as  rigid  bodies.  In  the  right  wiper  arm  (body  4),  the  link  connecting  bar 
and  wiper  arm  are  modeled  as  rigid  and  flexible,  respectively.  In  the  left 
wiper  arm  (body  6),  the  link  connecting  bar  and  wiper  are  modeled  as  rigid 
and  flexible,  respectively.  The  chassis  of  the  vehicle  is  body  1.  The 
mass  and  moments  of  inertia  of  each  body  are  given  in  Table  2. 


UNIVERSAL  JOINT 
(66.4,14.2,2.8) 


SPHERICAL  JOINT  7  ^BODY  5 
(1.5,10.5,6.4)  5  (CONNECTING 

REVOLUTE  JOINT  LINK)  REVOLUTE  JOINT 

(63.52,7.84,2.9) 


UNIVERSAL  JOINT, 
(-75,2.3,4.7)  23 

Z 

'Y, 


z' 

SPHERICAL 
(40.4,3.6,1.5) 


JOIN 


BODY  2  (CRANK  LINK) 
REVOLUTE  JOINT 
(45.4,1,05,1.2) 


Xl. 


BODY  I  (CHASSIS) 


Figure  10.  Windshield  Wiper  Mechanism 


Table  2. 

Mass  and 

Moments  of 

Inertia 

Body 

Moments  of 

2 

Inertia  (g«cm  ) 

No. 

(gram) 

^xx 

lyy 

Izz 

^xy 

^X2 

‘yz 

2 

61.3 

282.6 

314.7 

33.6 

0.0 

0.0 

0.0 

3 

189.2 

100.0 

36413.8 

36413.8 

0.0 

0.0 

0.0 

4 

567.0 

215730.0 

218240.0 

3610.6 

155.5 

-1737.8 

-4060.0 

5 

255.9 

300.0 

90133.7 

90133.7 

0.0 

0.0 

0.0 

6 

499.5 

198920.0 

202210.0 

4173.8 

-486.6 

10285.0 

-4006.0 
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There  are  three  revolute  joints,  two  spherical  joints,  and  two 
universal  joints  in  the  model.  Locations  of  joints  at  extreme  position 
measured  in  the  chassis  coordinate  system  are  shown  in  Fig.  10.  The  first 
revolute  joint  is  located  between  the  chassis  and  crank  arm.  The  second 
and  third  revolute  joints  are  located  between  the  chassis  and  the  right 
and  left  link  connecting  bars,  respectively.  The  first  spherical  joint  is 
located  between  the  crank  arm  and  drive  link.  The  second  spherical  joint 
is  between  the  right  link  connecting  bar  and  connecting  link.  The  first 
universal  joint  is  located  between  the  drive  link  and  the  right  link 
connecting  bar.  The  second  universal  joint  is  between  the  connecting  link 
and  left  link  connecting  bar. 

A  finite  element  model  of  the  right  and  left  wiper  arms  is  shown  in 
Fig.  11.  This  model  consists  of  26  nodes.  The  link  connecting  bar  (nodes 
1-8)  is  treated  as  rigid  and  the  wiper  arm  (nodes  8-26)  is  treated  as 
flexible.  The  cross-section  of  the  wiper  arm  is  modeled  as  rectangular, 
with  two  different  shapes  along  its  length.  To  consider  the  mass  of  the 
wiper  blade  in  the  finite  element  model,  a  nonstructural  mass  element  is 
attached  at  node  26.  Nonstructural  mass  elements  are  also  allocated 
between  nodes  8  and  14,  to  account  for  variation  of  the  cross-sections. 

To  calculate  the  center  of  mass  of  the  arm,  the  mass  of  the  link 
connecting  bar  is  lumped  at  its  nodes.  To  consider  elastic  deformation  of 
the  wiper  arm,  due  to  frictional  force  at  the  blade,  one  static  correction 
mode  is  defined  and  used  in  flexible  body  analysis.  That  static 
correction  mode  is  defined  by  a  unit  force  applied  at  the  tip, 
perpendicular  to  the  wiper  arm. 

The  friction  force  on  the  arm  tip  is  modeled  as  a  linearly  decreasing 
force  with  increasing  tip  velocity,  as  shown  in  Fig.  12.  Simulations  are 
carried  out  with  a  constant  crank  arm  speed  of  60  rpm.  Tip  velocity  and 
friction  force  from  rigid  and  flexible  solutions  are  presented  as  plots  of 
variables  versus  time  in  Fig,  13.  As  shown  in  Fig.  13,  the  linear  tip 
velocities  in  both  cases  are  different.  In  the  rigid  case,  vibratory 
phenomenon  can  not  be  detected,  but  a  vibratory  frequency  of  13-14  Hz  is 
predicted  in  the  flexible  case. 

XI.  CONCLUSIONS.  This  paper  presents  a  new  method  of  choosing  modal 
matrices  Yor  simuTatTon  of  flexible  mechanical  systems,  employing  static 
correction  and  vibration  modes.  The  following  conculsions  are  obtained. 

(1)  When  high  reaction  forces  occur  at  kinematic  joints,  static 
correction  modes  are  much  better  than  vibration  modes  to 
represent  deformation, 

(2)  If  vibration  and  static  correction  modes  are  combined, 
deformation  due  to  both  inertial  and  reaction  forces  can  be 
accurately  represented. 

(3)  Computing  cost  can  be  significantly  reduced  if  static  correction 
modes  are  employed,  instead  of  purely  normal  modes  of  vibration. 
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NODE  NUMBER 
BEAM  ELEMENTS 


Figure  11.  Finite  Element  Model  of  Wiper  Arms 


Fric.  at  tip  of  right  arm(  N  )  Vel.  of  tip  of  right  arm(m/sec) 


2.5 


0.0  0.2  0.4  0.6  0.8  1.0 


Time  (  sec  ) 


Figure  13.  Velocity  and  Friction  Force  at  Tip  of  Right  Arm 
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APPENDIX 


The  following  matrix  notation  is  used  to  implement  vector  operations 
a  =  Ca^^a^.a^]^ 


a 


-a 


z 


0 


a 


X 


0 


a 


a  X  b  =  a  b 


0 


+ 

a  = 


-a 


a 


T 


a 


Using  Euler's  theorem,  Euler  parameters  [17]  can  be  defined  as 
Bq  =  cos  ^  ,  e  s  n  sin  ^ 

where  x  is  the  rotation  angle  about  a  unit  vector  n  that  moves  the  body 

from  a  reference  orientation  to  a  general  orientation  and 
T  T 

e  =  [6^62,62]  .  Defining  p  =  [60,6^,62*62]  ,  and  has  the  following 
relations: 

T  1  T*  -T  „ 

PP=1  ,  pp  =  pp=0 


E  =  [-e,  e  +  egl]  ,  G  =  C-e,  -e  +  Cq!] 

e’''e  =  G^G  =  -pp^  +  ,  Ga  =  iG  -  ap^ 

Ep  *  Gp  =  0  .  EE^  =  6G^  =  I  ,  EG^  =  A 
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RESPONSE  OF  DAMPED  MECHANICAL  SYSTEMS 
TO  A  TIME  DEPENDENT  EXTERNAL  FORCE 
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ABSTRACT.  The  method  of  Krylov  and  Bogoliubov  is  extended  so  as  to  be 
applicable  to  ordinary  differential  equations  that  describe  the  motion  of 
linearly  damped,  linear  and  nonlinear  mechanical  systems  that  are  subjected  to 
a  particular  family  of  oscillatory  external  forces.  These  force  functions,  in 
turn,  satisfy  a  linear  second  order  ordinary  differential  equation  with  con¬ 
stant  coefficients.  This  method  is  particularly  useful  for  the  resonance 
analysis  of  systems  in  transient  or  steady  motion.  The  technique  is  applied 
to  nonhomogeneous  forms  of  the  differential  equations  of  Mathieu,  Bessel, 
Hermite  and  Duffing  for  the  purpose  of  generating  approximate  analytic  expres¬ 
sions  for  their  solution.  Numerical  computations  based  upon  these  analytical 
approximations  and  the  Runge-Kutta  numerical  integration  technique  reveal  that 
the  method  developed  here  produces  rather  useful  accurate  approximations. 

INTRODUCTION.  The  classicial  method  of  averaging  as  developed  by 
Krylov,  Bogoliubov  and  Mitropolsky  [1],  [2]  has  been  applied  by  many 
investigators  to  solve  linear  and  nonlinear  differential  equations  of  the  type 

X  +  +  £f(t,x,x;)  =  0,  (1) 

which  arise  in  the  theories  of  mechanical  and  electrical  oscillations.  In 
order  to  account  for  the  influence  of  linear  damping,  several  authors  have 
considered  the  differential  equation 
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X  +  Zyk  +  (D^x  +  0f(t,x,X) 


0, 


(2) 


and  have  introduced  certain  modifications  of  the  Krylov-Boyoliubov  theory. 
Various  such  techniques  have  been  reported  by  Brunelle  [3],  Mendelson  [4]  and 
Anderson  [5]  to  [B].  Stanisic  and  Euler  [9]  have  extended  the  Krylov- 
Boyoliubov  method  to  the  specific  non-linear  forced  motion  problem 

X  +  (o^x  +  p^x^  +  u^xx^  =  Ncosflt,  (3) 

which  is  an  equation  that  describes  the  motion  of  a  larye  class  of  nonlinear 
mechanical  systems,  as  reported  by  Kane  [10].  Stanisic  and  Euler  have  not 
accounted  for  the  possible  presence  of  a  linear  dampiny  term  in  (3). 

The  Stani sic-Euler  technique  consists  of  reducing  the  nonhomogeneous , 
nonlinear  equation  in  (3)  to  a  fourth  order  homogeneous,  non-linear  equation. 
In'the  form  of  the  solution  assumed,  there  appear  two  variable  amplitudes  and 
two  variable  frequencies  that  are  to  be  determined. 

In  the  present  investigation,  the  basic  approach  of  the  Stanisic-Euler 
solution  is  extended  to  a  more  general  class  of  problems,  namely, 

X  +  Zy^jX  +  a)|x  +  6f(t,x,x)  =  e0(t),  (4) 

where  Q(t)  is  known  to  satisfy  the  linear  differential  equation 

Q(t)  +  Zy2^i^)  +  =  0.  (b) 

The  initial  conditions  associated  with  (4)  are  taken  to  be 

=  x^,  Jdj  =  (6) 

Furthermore,  it  is  assumed  that  f.<  co^,j  «  1,2,  so  that  the  system  is  under¬ 
damped  in  the  linear  approximation.  The  problem  of  determining  the  response  of 
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dynamic  syst'dns  described  by  (4)  to  (6)  is  of  considerable  importance  for  the 
design  of  nonlinear  elements  in  machines,  mechanisms,  vehicles  and  other 
structures . 


METHUIJ  UF  SOLUTIUN.  The  objective  of  the  analysis  that  follows  is  to 
determine  an  approximate  solution  for  the  initial  value  problem  stated  in  (4) 
and  (b).  The  forcing  function  lj(t)  appearing  in  (4)  is  assumed  to  satisfy  the 
linear  differential  equation  in  (6).  The  first  step  in  the  derivation  of  an 
approximation  for  x(t)  is  the  elimination  of  the  presence  of  Q(t)  in  the 
equation  of  motion.  This  is  done  by  forming  the  first  and  second  derivatives 
with  respect  to  time  t  of  (4)  and  substituting  the  results  into  (b).  This 
process  leads  to  the  following  fourth  order  homogeneous  nonlinear  differential 
equation: 


x'^'^  +  2(y^  +  ^2^^  ^  ^  ^^^1^2  ^  ^  ^^^2*^1  ^  ^ 


+  X  +  eF(t,X,X,X,x)  =  0, 


(7) 


where 

^  .4  ri • * 

F(t,x,x,x,x,) 


9^f  ,  o  9^"  ^  .  9fi '  , 


■  DC 


.3’f 


3f,;:  .  9f*.v  .  3^'f  '.2 


f  2y2fj^]x  t  ^X  . 


3X  3J;9t 


9X' 


X  + 


+  2 


9?-f 

g'xyj: 


XX  + 


(8) 


Two  initial  conditions  for  (7)  are  already  available  in  (b),  but  two  more  must 
be  adjoined  to  these.  From  (4)  and  its  derivative  with  respect  to  t,  we  find 
that  these  are 
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(9) 


xCt  r  =  p  .  x(t  )  = 

o  '0  0  ^0 


where 


Po  '  “  “1^  "  ’ 

9f(t  ,x  ,v  )  9f(t^,x  ,v  ) 

.Q(tJ  -  -  ay^p^  -  °  °  °  +  V.. -  + 


(10) 


o3x 


af(t  ,x  ,v  ) 

•  0  0  0  T 

+  P^— T -  ]  . 

dt 


(11) 


Motivated  by  the  form  of  solution  of  the  linear  counterpart  (  e=  o)  of  (7), 
we  assume  that  the  solution  of  the  original  differential  equation  (4)  can  be 
expressed  In  the  form 


x(t)  =  a^sinip^  +  a2Sln<(>2i  (12) 

where  a.  =  a.(t)  and  i|».  =  fi.t  +  e.(t),  j  =  1,2  a,(t)  and  e.(t)  being  unknown 

J  J  J  J  J  J  W 

slowly  varying  amplitude  and  phase  functions  with 

n  =  (o)?  -  y.  <  0)4,  j  =  1»2.  (13) 

J  J  J  J  J 

If  we  form  the  first,  second  and  third  derivatives  of  (12)  with  respect  to 
time  and  proceed  In  the  spirit  of  the  method  of  variations  of  parameters,  we 
obtain 

X  =  fi^a^cosi(»^  -  Y^a^slnii)^  +  n2a2^os<ii2  -  Y2a25in<|;2  ,  (14) 

X  =  -2Y^n^a^cosi(»^  -  (fi|  -  yf )  a^siiii|<^  -  2Y2f^2®2^°^'*'2  ' 

-  (n2  -  Y2)a2s1nt2  . 
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X  -  “  fi|)aiCosij/^  +  (3Yin2-Y| ) +  {3Y2O2  ■ 

+  (372^1  -  Y|)a2sirn|)2 

and 

(a^  +  Y-] S'] )sinii)i  +  aie-|Cosi|j-|  +  (a^  +  Y232^^^'^'^2  ^  a2®2^°^'^2  ” 

(S^  +  Yia-i) (f^icosij^l  -  YiSinij)^)  -  a^§i (yiCost|;^  +  fiiSin4)i)  + 

+  (a^  +  7282^  ~  Y2Sini^2^  “  '*'  "" 

(a-j  +  Y-ja-i ) [2YiJ2iCostp^  +  (n|  -  Y^)sin4)^]  -  eie-j [2YifiiSinijJi  - 

-  (f2|  -  Y^)cosi|)i]  +  (82  +  Y2a2)[2Y2J^2  cosi|j2  +  (^2  ~  Y2)sinii)2] 

-  a2e2[2Y2^^2^^'^'*'2  '  ^^2  '  ^  °  * 

The  specific  forms  retained  in  (14)  to  (16)  are  analogous  to 
would  occur  in  the  analysis  of  the  corresponding  linear  problem. 

Substituting  (14)  to  (16)  and  the  apposite  expression  for  x^' 
obtain 

(a-j  +  Y^a^)(eiCOSi|;^  +  f^sinij^^)  +  a^e-j (f^cosijj^  -  e^sini)j-j)  + 

+  (82  +  Y2®2^  ^®2^°^’^2  '''  ~  ' 


20051/^2  + 
(16) 

0  .  (17) 

(18) 

0. 

-  (19) 

those  that 

^  into  (7), 

(20) 

=  -  eF, 


where  F  has  been  defined  in  (8)  and  for  j  =  1,2 


The  system  of  equations  in  (17)  to  (20)  may  be  considered  a  system  of 
linear  algebraic  equations  in  the  four  unknown  a.  +  y-i.  and  e  •,  j  =  1,2.  The 

J  J  J  J 

process  of  solving  this  system  of  equations  leads  to  the  following  system  of 
first  order  differential  equations: 


+  Y^a|  =  eF(R^^cos;^^  + 

=  -  (eF/a^)(R^^sini|j^  -  ,  (22) 

®2  ^^2^2  "  eF(R22COSij;2  ‘ 

02  =*  -  (eF/a2)  (R22sin(i<2  +  R^2*^°^'^2^  ’ 


where 


^22  ~  ~ 

^2  2  2 
R  =  {ij)^  -  w^)  +  4(Yi(i)2  ■*■  >'2*^1 )  “  4yiY2^‘*^]  ■•■  *^2^ 

The  system  of  first  order  differential  equations  in  (21)  to  (24)  must  be 
accompanied  by  a  set  of  four  initial  conditions.  According  to  (6),  (9),  (12) 
and  (14)  to  (16),  we  have 

a^oSin^-10  +  ^2^510^20  =  "o 

s7^a^QCOS){'^Q  -  Y^a-iQsiniii^Q  +  n,2a2gCos\()2Q  “  Y2®20^^'^’*’20  ^  ''o 

o  ,  P  •  o  ,  (26) 

1 Q  +  (^1  ^  ^Y2^2 ^20^^^^20 

+  (^2  ”  ^2 ^^20^^ '^'^20  *  "Fq  ’ 


5'^(3y|  -  s.>j)a^gCosii)^g  +  Y^(3fi|  -  y^ ) a^gSinijj^g  +  n2(3Y|  -  n|)a2Qcosi()2Q  + 

+  y2(^^2  ~  ^ ^20^ ^ ^*^20  ”  ^0  * 
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wliore  a .  “  and  'I'.  «  w.t  +  6.(t  ),  j  =  1,2,  This  is  a  system  of  four 

JOJO  JO  JO  J  O 

.ihjebraic  equations  in  the  four  unknowns  ^jo^’^’^jo*  Solving 

this  system  for  aj(t^j),  0j(t^),  we  find 

1/2, 


a^(to)  =  (W2/n2  +  w2)''VR  , 


-1 


9^(tp)  =  tan  (J1^W2/W-|)  -  aj-jtQ  , 


1/2 


a2(tg)  =  -  (Z2/fi2  +  z2)'/VR  , 


(27) 


02{tQ)  =  tan  "*  “2^( 


v/liere 


+  2y2(2y2  -  3y^)i»)^  +  2y|i*J2  ■•■  ^  2y2)<^2  “ 

“  3yi<o|  +  4yi(y^  -  Y^)]  +  *lo^‘^2  "  ■  "^2^^  * 


W. 


XQa.i2[a)2  -  0)2  +  4y^(y^  -  Y2)]  +  ^''o^^l‘^2  ~  ^2*^1  '''  "  "^2^^ 


+  pQt'-^2  "  “1  ■*■  “  Y5)]  +  2qn(Yi  “  Y?)  » 


O"! 


(2b)) 


Z-j  =  XqO)|[{2yi  -  3y2)“2  ■*■  ^2^^!  ^  ^T2(y2  “  y^)]  +  VQt(D|(<D^  -  o)|)  + 
+2Y|a)|  +  2yi(2yi  “  3y2)‘*>2  +SyiY2(y2  “  Y^)]  +  Po(^'''2  ^  ” 


“  3y9(‘)9  +  4y2(y2  "  y|)']  +  qn(“i  ■  Y^)]  > 


Q  -W2  ^  ^  12 » 2  M 


^2  ^  XQtjj^[o)|  -  0)^  +  4y2(y2  ~  y-])]  +  2Vq[y2<^|  "  y]^2  ^  '*' 


+  -  0)^  +  4(y2  “  Yf)]  +  2qQ(Y2  *  y^) 
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In  the  event  that  yj  *  i.e.,  there  is  no  damping  in  the  system, 

the  forms  of  the  initial  conditions  in  (27)  become  significantly  simpler. 
Using  (6),  (10),  (11)  and  (27),  we  can  verify  that 


ai(to) 


-  [(Vga)|  +  qg)‘‘/(i)|  +  (XqU^  +  -  (lj|l 

1  -  “1^0  . 


(28), 


a2(t^) 


-  [(VqW^  +  qQ)^/(^2  ^  ~  * 

2(to)  “  f  “VgO)^  +  qg  ^  ■  “2^0  • 

In  the  spirit  of  the  Krylov-Bogoliubov  method  [1],  we  form  the  averages  of 
(21)  to  (24)  to  obtain 

a-j  +  Yia-i  =  ^  F’cosij;^  ^  ^12  Fsiniii-i  >]  , 


e-j  =  -(e/ai)[R^^  <  Fsinii;^  >  -  <  Fcosif^i  >]  , 

®2  ^2^2  ^  ^^^22  •^cosif^2  ^  "  *^12  Fsinij;2  >]  , 


(29) 


02  =  (e/a2)[R22  <  Fsinijj2  ^  +  f^]2  ^  Fcosi(j2  >] 


where 


<Fcos4).  >  =  (l/4Tr2) 

J 


fZTT  rZn 


0  -’o 


Fcos<f<jdi|;idij;2  • 


<Fsinif).  >  =  (l/4Tr2) 

J 


2tt 


Ztt 


Fsinif;  .diii.dij; 


(30) 


The  function  F(t,x,x,X,'x)  that  appears  on  the  right  side  of  the  system  of 
differential  equations  in  (21)  to  (24)  was  defined  earlier  in  (8).  The 
quantities  x,x,'x  and  x*  appearing  in  (8)  are  next  replaced  by  the  trigonometric 
expressions  stated  in  (12)  and  (14)  to  (16),  This  implies  that  F  becomes  a 
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function  of  t,  '1'^  and 'I' 2*  Thus,  it  becomes  convenient  to  express  F  as  a 
double  Fourier  series  as  follows: 

F(t,x,x,x,x)  = 

'  Jo  nio  +  F^)(t)s1n(m*,)]x 

^[^mn^^t)cos(ni|;2)  +  F|jjJ^t)sin(nt|;2) ]  ,  .  (31) 

j 

where  the  coefficients  Finn(t),  j  *  1(1)4,  are  known  functions  of  time  t.  In 
expanded  form,  the  first  few  and  most  important  terms  of  (31)  are 

E^sinii<^  +  G^cosi|^2  H^sini/^2  +  •“  »  (32) 

where  the  explicit  forms  of  the  coefficients  F^,  F^,  ...,  are  known. 


If  we  use  (32)  in  (30),  we  find  that 

<  Fcos\|;i  >  =  Fi/2  ,  <  Fsinil^i  >  =  Ei/2 

<  Fcosi^2  ^  ~  ,  <  Fsinip2  >  “  H^/2 


(33) 


Substitution  of  (33)  into  (29)  yields  the  simplified  or  averaged  system  of 
equations 


ai+  ^1^1  ®  (  g/2 )  ( R"!  1  F-|  +  R12E1)  , 

61  —  ~  ( e/Za-j )  ( R-j  1 E 1  “  ^12^1^  * 


32"^  Y2  ®2  ~  (  e/2  )  ( ^22^1  ”^12^1^ 


(34) 


(e/2a2)(R22HT  + 
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From  this  point,  the  objective  is  to  solve  the  system  of  ordiiui ry  di fftjrenti a  I 
equations  in  (34)  subject  to  the  set  of  initial  condit  ions  that:  has  been 
stated  in  (2/). 

linear  differential  equations.  As  a  first  application  of  the  theory 
developed  in  Section  2,  we  consider  the  following  linear  differential  equatit)n 
with  variable  coefficients: 


X  +  5(t)x  +  [a)|  +  n(t)]x  =  cQlt),  tg  <  t  .  (3b) 

The  initial  conditions  are  those  stated  in  (6),  and  Q(t)  is  assumed  to  satisfy 
the  differential  equation  (5)  withY^  =  U*  Comparing  (3b)  with  (4),  we  find 
Yj^  =  0  and 

ef(t,x,x)  =  5(t)x  +  Ti(t)x  .  (3b) 

In  this  case,  (B)  leads  to 


eF(t,x,x,x,x)  =  h^(t)x  +  h2(t)x  +  h2(t)x  +  h^(t)‘x  , 


(37) 


where 


h^(t)  =  n(t)  +  (D|n(t)  ,  h2(t)  =  c(t)  +  a)|?(t)  +  2fi(t)  , 

h2(t)  =  Ti(t)  +  Ziit)  ,  h4(t)  =  c(t)  . 


(38) 


In  the  event  that  y^  =  «  0,  (2b)  reduces  to 

1 


R 


11 


(li ^  ((ji)"j  (JO2 ) 


^12  ~  C  >  ^22 


a)2((u|  - 


Consequently,  the  averaged  differential  equations  in  (34)  become 
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eR^^F^/2  0^  =  -eR^^E^/2a^ 

^2  ~  eR22^]/2  ,  e  =  -eR22Hi/2a2 

where,  in  view  of  (32),  (37)  and  (3B), 

=  (a)^/e)[£(t)  +  ((jj2  -  (D2)5(t)  4-  2n(t)]a^  , 

=  (l/e)[n(t)  +  (o)^  -  (D|)ri(t)  -  2a)||(t)]a^  , 

=  (o)2/£)[i(t)  +  2n(t)]a2 

=  (l/e)[n(t)  -  2a)|5(t)]a2  . 


(39) 


(4U) 


When  (4U)  is  substituted  into  (39),  the  resulting  system  of  first  order 
differential  equations  is  found  to  be 

a, 

^  =  2(uy^' (D^)  2n(t)  +  (u^  -  w|)5(t)]  , 

®1  "  2«^(aip'  ‘  » 

a.  ..  . 

^2  =  2'U^  ^  » 

"1  ■  «♦ 

00  =  oT  /  r - 2T  “  2a)2t(t)]  . 

2  2ai2(aj|  “  a)|;  '■  2 

The  system  of  differential  equations  in  (41)  is  a  linear  system  of  elementary 
uncoupled  differential  equations  with  coefficients  that  vary  with  time  t. 


In  view  of  (6),  (lU),  (11)  and  (36),  the  initial  conditions  associated 
with  the  system  (41)  are  the  expressions  given  in  (28)  with 

Pq  "  ^Q(tQ)  “  * 

(42) 

q^  =  cQit^)  -  XQn(tQ)  -  VQ[n(tg)  +  +  Kt^)]  -  Po?(tp)  . 
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The  differential  equations  in  (41)  are  easily  integrated  to  yield 


rl(t)  +  2r)(t)  -  |(t«)  -  2ri{tf,)i  ,  ft 

a^(t)  =  a^(tQ)exp{ - ^ - C(T)dT}  , 

0  * 

i  L  ■  2c,(4  -  ■  "'‘o’  * 

1  tQ  \  \  d. 

+  +  ^^(tQ)  ,  ' 

c(t)  +  2n(t)  -  |(tp)  -  2n(tp) 

ajd)  =  a2(t(|)axp{ - .  „2-j - )  . 


®2'‘’  '  2(.g(4’-  ■  "'‘o’  *  2“25'‘o”  *  ®2'‘o’  . 


The  quantities  ^jlt^)  and  Oj(tg),  j  =  1»2,  are  to  be  determined  from  (28)^  and 
(42). 

Example  1.  We  consider  the  following  nonhomoyeneous  Mathieu's  equation: 

X  +  (o)^  +•  4bcos2t)x  =  eQpSina)2t,  0  <  t  .  (44) 

A  comparison  of  (3b)  and  (44)  reveals  that 

5(t)  =  0,  n(t)  =  4bcos2t,  t^  =  0,  Q(t)  =  QQSint«)2t.  (45) 

Substituting  (4b)  into  (43),  we  find 

a,tt)  =  a,(0)axp[‘“'f‘^‘,-  ”l  , 

I  I  -  (1)2 
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=  ..I  -“2  "  ^  ■ 


.^,t,  ,  a,(t„)exptMY,.^t),  , 


So(t)  =  -  I  2 — TiTT  sin2t  +  e,(tp)  . 


In  the  present  case,  (42)  reduces  to 


■XQ(a)2  +  4b),  =  ec02QQ  -  VQ(a)2  +  4b)  , 


by  virtue  of  (45).  For  the  initial  conditions  x  =  v  =  0,  we  obtain  p  =  U 

0  0  ^0 


and  Hence,  (2Ba)  leads  to 


a^(0) 


oil  (a)2  “  (tij)  ’ 


S^(0)  =  0  , 


a^(0)  = 


-  (^2 


,(0)  =  0  . 


Consequently,  (46)  can  now  be  expressed  as 


a,  (t)  = 


^“2^0  „..„,4b(cos2t  -  1) 


^1^2  * 


aj|  - 


.  ^  ^0  „.._r4b(l  -  cos2t). 

a«(t;  -  —7* — - 2 — 

2  0)^  -ID^  tiij  -  0)^ 


0«(t)  =  - p-7 - 2ySin2t  . 

2  "  ^1 
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Finally,  substitution  of  (47)  into  (12)  leads  to  the  following  approxim¬ 
ation  for  the  solution  of  (44)  subject  to  the  homogeneous  initial  conditions 
x(o)  =  x(o)  =  0. 


x(t) 


eQ, 


^7! (^2/^1 ) exp [ 


w,  -  w 


bi 


,2  _ 


(1)2  + 


O)-)  (u) 


T' 


4) 

7T — sin2t]  - 


CO  ^7 


(48) 


4b(l  -  cos2t) 


•]sin[co2 


t  + 


4b 


(02(0)^  - 


j7ysin2t]} 


This  approximate  solution  for  (44)  consists  of  the  sum  of  two  sinusiodally 
varying  terms  whose  arguments  consist  of  the  sum  of  a  linear  term  and  a  sine 
term  in  t.  The  magnitude  of  the  phase  angle  becomes  very  large  as  the  value 
of  102  toward  10^.  In  addition,  the  primary  sinusoidal  terms  are  multi¬ 

plied  by  exponential  functions  whose  arguments  depend  upon  cos  2t.  These 
factors  cannot  give  rise  to  an  unlimited  growth  in  time  of  the  amplitude  of 
motion.  Since  the  leading  coefficient  in  (48)  contains  the  term  co^  -  co^  in  the 
denominator,  the  value  of  x  will  become  very  large  whenever  the  value  of  is 
close  to  that  of 

To  assess  the  accuracy  of  the  approximation  for  x(t)  given  in  (48),  we 
have  evaluated  this  approximate  solution  and  have  solved  (44)  subject  to  the 
initial  conditions  x(o)  =  x(o)  =  0  numerically  by  means  of  the  Runge-Kutta 
method  (hereinafter  called  the  "exact"  solution).  The  numerical  results 
obtained  in  this  manner  have  been  plotted  on  the  same  set  of  axes  (x  versus  t) 
for  purposes  of  comparison.  For  the  choice  of  parameters  =  i.s,  =  6,  b 
=  0.05,  Qq  =  1  and  e  =  0,1,  the  exact  and  approximate  curves  for  x(t)  versus  t 

have  been  plotted  in  Figure  1  on  the  interval  0  _<  t  £  10.  In  general,  the 

quantitative  agreement  between  the  approximate  and  exact  curves  is  quite  good 
over  the  entire  interval  shown.  If  the  values  of  all  the  preceding  parameters 
are  held  constant  while  the  value  of  diminished  from  W2  =  6  to  0J2  =  4, 

then  numerical  calculations  yield  the  plot  of  x{t)  versus  t  shown  in  Figure  2. 

Once  again,  good  agreement  between  the  exact  and  approximate  curves  is  very 
good . 
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The  approximation  in  (48)  leads  to  somewhat  less  satisfactory  results  when 
the  frequency  parameters  and  were  assigned  the  values  of  =  1.1  and 
=  3.  This  is  not  surprising  since  the  nature  of  the  solution  of  the  homogen¬ 
eous  form  of  Mathieu's  equation  is  known  to  change  when  the  value  of  is 
close  to  unity--see,  for  example,  Nayfeh  [11],  pp.  234  to  2b4. 

Example  2.  Consider  next  the  non-homogeneous  form  of  Bessel's  equation, 
namely, 

X  +  (l/t)x  +  (cjf  -  vVt^ix  =  eQ(t),  0  <  tQ  <  t,  (49) 

where  Q(t)  is  assumed  to  satisfy  the  differential  equation 

Q  +  =  0.  (493) 

Upon  referring  to  (3b),  we  see  that  for  (49) 

£(t)  =  1/t,  n(t)  =  -  (bU) 


Consequently,  (43)  leads  to 

,1/2 


ai(t) 


0^(t) 


.1  1 


(v^  +  2)cd?  -  (va)„)  -I  -1  .,2  ,  , 

' - 2.^Ui  -  J’t  - 


^^(t)  =  -  p>l. 


(51) 


®2^^^  "  cog(a)^'-  ~  ~  ^2^7  ~  ^  ^2^^0^ 


^  ^  -  i-)l  +  ejt.) 
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where  the  quantities  aj(t^)  and  0j(tQ),  j  =  1.2.  are  to  be  determined  from 

(2B)2. 

In  case  of  »=  0  and  Q(t)  =  Q^^sin  w^t,  we  find  from  (49)  that 

Pq  =  eQgSintD2tQ»  Pg  “  ~  (l/tQ)sin(a)2tg)] . 


Therefore,  leads  to 


a^(tp)  =  [(qg/w^)^  +  pgl^^^/lwl  -  0)^1^ 


01  (tg)  -  tan  [;jjcosIuJt^ 


ajlSin(a)2tQ) 


:o)sin(a)2tg] 


“l^O’ 


^  0^2^  i  n  ( (^^2^0 ) 

02 (tg)  =  tan  [(^,^cos(a)2tg)  -  { l/tg)sin(a)2tgy^  “2^0' 


Therefore,  an  approximation  for  x(t)  is 


x(t)  :  a^(tg)(tQ/t)^/^exp[2]^2 


[(2  +  v2)a)2  -  1  ,} 

2(jJi  (a)^  -  COp )  t  tg  tOild)^  “  u)^)  tg 


J-|  VU)-|  0^2  J 


?  -  ■  1^’  •  * 


+  a2(t„)exp[2(„|  -  |T)lsin{»2t  + 


:|-3  -  {t)  +  <^2^  '  ^  92^^0^^ 
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As  t  becomes  very  large,  the  coefficient  of  the  first  sine  term  in  (53)  tends 
to  zero.  Consequently,  as  t ,  (53)  reduces  to 


x(t) 


a2(tQ)exF 


1  + 


|sin{u2(t  -  tQ) 


+  tan 


032510032^0 


032COS032tQ 


'tQTsino32to 


(b4) 


As  in  the  preceding  example,  numerical  calculations  have  been  performed  to 
solve 

X  +  (l/t)x  +  (03^  -  v2/t2)x  *  eQQSinu32t,  0  <  tQ  <  t, 

subject  to  the  initial  conditions  xlt^)  =  x(t^)  =  0.  Some  typical  plots  are 
shown  in  Figures  3  and  4,  for  which  the  following  numerical  values  for  the 
parameters  were  used;  t^  =  1,  =  1,  =  1  and  e»  0.1.  The  value  =  6, 

v=  1/4  and  ^2  =  4,v  =  i  were  used  in  the  plotting  of  Figures  3  and  4,  respec¬ 
tively.  Figure  3  shows  that  there  is  very  good  agreement  between  the  approx¬ 
imate  and  exact  curves  over  the  entire  time  interval  considered,  namely 
1  _<  t  _<  10.  The  approximation  proves  to  be  slightly  less  precise  in  Figure  4, 
where  has  a  smaller  value.  The  approximate  curve,  nonetheless,  respects 
the  qualitative  features  of  the  exact  curve  reasonably  well. 

Example  3.  The  nonhomogeneous  form  of  Hermite's  differential  equation 
(see  Sneddon  [12],  p.  152)  is 

X  -  2tx  +  2vx  =  £Q(t),  0  <  t, 

where  Q(t)  satisfies  (5)  when  Y2  =  0.  Since  (56)  is  a  particular  case  of 
(3b),  we  have 

03^  =  2v,  ^(t)  =  "2t,  n(t)  =  0,  ^0  ” 
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When  the  expressions  in  (b?)  are  inserted  into  (43),  the  results  are  found  to 
be 


^2 /p  .  ,  2^1 1 

a,(t)  =  a,(0)e  ,  e,(t)  =  e,(0) - -r - ^2  . 

I  I  I  I  toj  - 


a^lt)  =  a^CO)  , 


2co«t 

e;,{t)  =  9„(0)  -  2--  2  . 

2  2  (1)2  ~  ^2 


(b8) 


where  for  x  =  v  =  u  and,Q(t)  =  Q  sin 

0  0  0  d 


a-j  (0) 


etii^Qo 


to^  ]  ti)|  -  a)|| 


32(0) 


-eQ 


9^(0)  =  62(0)  =  0  . 


(b9) 


When  (b8)  and  (59)  are  substituted  into  (12),  the  result  is  the  following 
approximation  for  x(t): 


x(t) 


f^/9 

- {(aj„/a),)e  sin[a),t(l 

-  “fl 


w ^  “  ^2 


)]  - 


(60) 


Numerical  results  based  upon  the  approximate  solution  (60)  and  the  "exact" 
Runge-Kutta  solution  have  been  plotted  on  the  interval  0  _<  t  £  1.5  in  Figures 
5  and  6,  where  the  set  of  parameters  1^2  -  6,  v=  3  and  (^2  “  respec¬ 
tively,  have  been  used  with  =  1,  0.1.  The  agreement  between  the  approx¬ 

imate  and  the  "exact"  solutions  is  generally  rather  good  for  the  time  interval 
considered. 


164 


4.  A  NONLINEAR  DIFFERENTIAL  EQUATION.  In  the  preceding  section,  we 
showed  that  the  general  theory  developed  in  Section  2  can  be  applied  effect¬ 
ively  to  several  nonhomogeneous  linear  differential  equations  with  variable 
coefficients  for  the  purpose  of  generating  analytical  expressions  that  repre¬ 
sent  approximate  solutions.  It  is  next  of  interest  to  examine  a  forced  motion 
problem  in  the  theory  of  nonlinear  oscillations.  In  particular,  we  consider  a 
form  of  Duffing's  equation  that  includes  the  effect  of  linear  damping,  i.e., 

X  +  ZyX  +  +  eax'^  =  £Q(t)  ,  0  <  t  ,  (61) 

where  Q(t)  satisfies  (b).  Comparing  (4)  and  (61),  we  write 


•  3 

Y-]  =  Yi  (*>1  *  HD*  f(t,x,x)  =  ax  ,  (62) 

where  a=  l  for  a  hard  spring  and  a=  -1  for  a  soft  spring.  Using  (62)  and  (8), 
we  can  easily  show  that 


F  =  .  2.  F  |I  X . 


03  0,  9  p 

~  aoj^x'^  +  6aY2X  X  +  3ax  x  +  6axx  ' 


(63) 


If  we  now  use  (12),  (14)  and  (lb)  with  (63),  we  can  verify  that 

3  2  2 

aF  =  A^sin  ij/-!  +  A2Sin  +  A^sinijj^sin  \p^  + 

3  2  2 

+  A^sin  +  AgSin  +  AgSin  i|)^costjj2  + 

+  AySini(;^cosil/iSin4j2  +  AgSin\(j^sini)j2C0S\j;2  + 

2  2  2 
+  AgCosil^iSin  +  A^gSin  :p2C0S\p2  +  A^iSint^-jCos  \|)i  + 

2  2 
+  A.|2siniij^cos  ^p^  +  A^gSiniiJ^cos<Jj-|Cosil)2  +  A^^cos  i/)^sinijj2  + 

2 

+  AigSini|)2C0S  ip^  +  AigCOS4JiSinij)2COSili2  »  (64) 
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where 


=  ((0^  -  3(0^  +  12Yf  -  6YiY2)af.  ^2  " 

=  3(2y|  +  2YjY2  +  2y2  -  “  (»)|)a^a|,  A^  =  2(3y2  "  ti)|)a^, 

Ag  =  6Qi(y2  "  3Yi)a|,  Ag  “  -12Yin2a|a2  , 

Ay  =  -24yiJ^i 31^32’  ^8  ~  Y2)^2^1^2’ 

Ag  =  -6n^(Yi  +  Y2)aiaf^  A^O  "  -'^^y2^2^2* 

A^2  =  6fi|a^a|,  A^2  =  12!:2jfi2®f®2’  ^14  '  ®^f®i®2’ 

Ai g  “  6fi2a2»  ^16  ~  l2f2iJ^2a'j ^2  * 

If  (64)  is  used  with  (30)  and  (33),  we  find  after  some  manipulation  that 

<  Fcostfi-i  >  =  (a/8)(Ag  +  2Ag), 

<  Fsin(|j^  >  =  (a/8)(3Ai  +  2A2  +  A-ii  +  2Ai2)»  (66) 

<  Fcosii^2  ^  “  (0/8)  (2Ag  +  A^iq), 

<  Fsin\Jj2  >  =  (a/8)(2A2  +  3A^  +  2A14  +  A-jg), 


166 


where  it  may  be  observed  that  the  coefficients  Ag,  A^^g  and  A^g  defined  in 
(6b)  do  not  appear.  In  view  of  (66)  and  the  riyht  side  of  (29),  it  can  be 
shown  that 

R.|1  <  Fcosiii-i  >  +  Ri2  <  Fsin\);i>  *  H-jal  +  , 

R-jl  <  FsinTj;^  >  -  Ri2  <  Fcosi^-j  >  ®  Hga|  +  H^a^a|  , 

(67) 

^22  ^  ^  “  *^12  Fsinii/2  >  =  -Hga|a2  -  Hga|  , 

R22  <  Fsin\p2  ^  ^12  Fcosti)2  >  =  Hya|-a2-^+  Hga|  , 

where 

=  (3aYi/2R)  [a)|  -  u)|  -  2(y2  "  Yi)^]»-  H2  =  (3aY2/R) 

Hg  =  -(3a/8fiiR){(«o|  -  a)|)2  +  4yi(3yi  -  -  u^)  + 

+  8y|(y2  ‘Yi)^  +  4w|(y2  ‘  ’''l^^'''2  " 

=  -( 3a/4ni R)  [ (^2  “  w^)^  +  4y^(u)|  -  to^)  -  4()0^(y2  ~  Y^)]j  (68) 

Hg  =  (3aYi/R)  [(*J2  “  “I  "  2(y2  "  Y-])^'].  Hg  =  (3aY2/2R)  t(u|  -  a)|). 


Hy  =  (3aYi/f22R)  [(3yi  -  -  Y-j^l  +2y2(y2  "  Yi)^]> 

Hg  =  (3aY2/2J^2*^^  "  ''’2^“2  "  ’>'2“!^* 

As  a  consequence  of  (67),  (29)  can  now  be  expressed  as 

^  ~  e(Hia^  +  H2aia|), 

6^  =  -^(Hgal  +  H^a^), 

®2  ^2^2  ~  -2e(Hia|a2  +  H2e|), 

©2  =  -e(Hya|  +  Hga|) . 


(69) 
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Example  4.  Suppose  that  y,^  =  0,  so  that  Q(t)  is  a  solution  of  (49a) 
now  follows  from  (68)  that 

=  (3aYi/2R)  (wl  -  -  2y^)  j  ^2  ~ 


=  -Oa/Sn-jR)  [(a)|  -  (o|)^  +  +  8y^], 


(70) 


where,  in  view  of  (2b), 


da 


=  (3aY|/(»)2R)  (3^2 

-  <«)|) , 

00 

II 

0 

CSJCVJ 

3 

-  a)|)2  +  (2yi(ii)2)^« 

(71) 

equations  of  motion  (69) 

reduce 

to 

Y,a, 

=  eH^a3, 

(72) 

=  -e(H2a|  +  H^a^) , 

(73) 

=  -2EH^a^a2, 

(74) 

h 

=  -eHya^. 

(7b) 

m 

is  uncoupled  but  nonlinear. 

,  It  can  be 

Indeed,  we  can  write 

‘1 

«  eH^dt, 

(76) 

0  and 

(jo^  >  tUj  +  2y2»  so  that 

>  0  and 

=  Yi/eH-i  >  0. 

(77) 

it 
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The  integral  of  (76)  is  easily  verified  to  be 

ce~^l^ 

'  TTTTW^ 


{7«) 


where 


r  =  [c/a^(0)]^  -  1 


(79) 


Since  the  explicit  form  of  aj^(t)  is  now  available  in  (78),  we  can 
determine  a2(t)  through  an  integration  of  (74).  The  result  is 

iy{t)  =  cf(r  +  e“^^l^) , 


(8U) 


where 


a  =  a2(0)[a^(0)/c]‘ 


(81) 


Finally,  using  the  expressions  for  aj^(t)  and  a2(t)  in  (78)  and  (80),  we  derive 
the  following  phase  angle  expressions  from  (73)  and  (7b); 


r  +  e"^^l^ 


0^(t)  =  (H3/2H^)£n[  .  ^  ^  ^ -  -^[(2r  +  1)^  +  Y-, 


t(2r)- 


-  (2r  +  e‘^^1^)^]  +  0^(0) 


(82) 


and 


r  +  e‘2^1^ 


02(t)  =  (H^/2H^)tn[--y  ■{ - 1  +  02^°^ 


(83) 
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Expressions  for  the  initial  values  aj(0)  and  ej(0),  j  =  1, 
determined.  When  yg  =  0»  (28)  yields 

^1  -  Yi(XqO)2  +  Pq)(o)2  “  3a)|  +  4y|)  +  (Vq^^  +  cig)(t«)2  ■  + 

W2  =  (XqOiI  ^  Pg)((i)|  -  0)2  +  4Yf)  +  2Y^(Vga)2  +  q^), 

^1  ”  2y^w|(Xq(i)|  +  Pq)  +  Vg[a)|(o)|  -  a)|)  +  (2y^W2)2]  +  qg(a)| 

^2  ~  XqO)|(o)^  -  (1)2)  -  Pq(w^  “  o)|  +  4y|)  -  2Yi(Vga)|  +  Pq)  . 


Furthermore,  from  (10)  and  (11),  we  find 

Pq  =  -  “^0  ■  ^YiVg  -  eax3, 

qg  =  eQ(0)  -o)fVg  -  2YiPg  -  3EaVQx2^ 

since  f(t,x,x’)  =  ax^.  In  the  case  of  Q{t)  =  sin{oj2t)  and  x^ 
have  from  (86) 

Pq  =  0.  q,,  =  • 


whereas  (84)  yields 

^1  ~  ^  2y^)  >  ^2  ~  ^^^l'*^2^0  ’ 

Z-j  =  qg(f‘3^  -  top,  Z2  =  "2Yiqg  . 


Consequently,  (27)  leads  to  the  expressions 
apO)  =  (ea)2Qg/Jli R)  [ (a)|  -  a)|)2  + 

32(0)  =  -eQg/R^^^, 

«1«”  =  6^(0) 


+  topi 


1/2 


9 


=  tan 


r  ^"'“2  1 


,  must  now  be 

2Yf). 

(84) 

“  (Op, 

(85) 

=  V  =0,  we 

(86) 


(87) 
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where  R  was  given  earlier  in  (71). 


Substituting  (78)  and  (80)  into  (12),  we  find 
ce”^1^ 

x(t)  =  - +  e,(t)]  + 

(r  +  '  ' 


-2y  t 

+  32(0)-^ — (r  +  e  )sin  +  e2(t)J, 


(88) 


where  9  (t),  j  =  1,2,  are  defined  in  (82)  and  (83),  with  the  initial  values 

J 

being  given  in  (87). 

It  is  easy  to  see  that  the  function  aj^(t)  in  (78)  approaches  zero  as  t 
tends  to  infinity,  whereas  according  to  (80)  and  (83) 

a2(t)  ~  a2(0)[l  -  af(0)/c2)  , 

02(t)  (Hy2H^)£n(-p:^)  +  tan 

as  t*"^“  .  Consequently,  in  the  steady  state,  we  have 

x(t)  ~  32(0)  [1  -  a|(0)/c2]sin{<i)2t  +  (H^/2H^ )  Jin(-p^)  + 

_1  27-10)0  (89) 

+  t  an  [—2 - 7]  ^  • 

ii>2  ~ 

In  the  case  of  «=  1,  o)^  =  1,  o)^  =  6,  =  1  and  e=  0.1,  plots  of  the 

variation  of  x(t)  with  t  as  determined  from  (88)  are  shown  in  Figures  7  and  8 
for  =  0.25  and  Y^  =  0.75,  respectively.  The  exact  and  approximate  curves 
in  these  figures  are  in  excellent  agreement  on  the  time  intervals  considered. 
In  Figure  8,  the  higher  coefficient  of  damping  (y^  =  0.7b)  causes  a  rapid 
decay  of  the  transient  component  of  the  motion,  and  the  steady  state  behavior 
as  described  by  (89)  becomes  evident  after  the  first  two  cycles. 
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If  the  nonlinearity  of  the  system  is  characterized  by  a  soft  spring 
{a=  -1),  the  form  of  the  solution  just  presented  must  be  modified  slightly. 
We  write 


B  = 


=  (3Y^/2R)(a)|  -  a)|  -  2y|) 


H3  =  (3/8n^R)[(u)|  -  0)2)2  +  +  By*}] 


(9U) 


=  -3/4fi^  ,  H7  «  -(3Yf/w2R)(3a)|  -  o)2)  . 

Equations  (73)  and  (7b)  remain  unchanged,  but  it  is  advantageous  to  replace 
(72)  and  (74)  with 


a]  +  Y^a^ 


-cBa|  , 


(yi) 


and 


^2^  ~  2£Ba^a2  • 


(92) 


respectively. 


Integration  of  (91)  leads  to 


ve 


-Yit 


(93) 


where 


•  t,/c6  "  2R/3e(u2  -  •  1  +  [v/a,tO)]2.  (94) 


The  integrations  of  (92),  (73)  and  (7b)  are  elementary.  The  results  are 
a,(t)  «  a5(0)a|(0)(M2  -  e"^Yl^)/v2  , 


^2y  t 

e^(t)  =  e^(0)  -  (e/4Y^)(2H3v2jin[-^-|^^-y '*  + 


(95) 
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+  H^[a|(o)a2(0)/v2][4Y^M''t  +  Cl  -  - 


-  (2M2  -  e‘'^^r)2}  , 


M2  - 


e2(t)  =  62(0)  -  (EH^v2/2Y^)£n[-  %|2  :  T  -■]  , 


where  the  values  of  3^.(0)  and  ®j(0),  j  =  1.2.  can  be  determinted  from  (87). 

Therefore,  by  virtue  of  (12).  (93)  and  (9b)  to  (97).  an  approximate  sol¬ 
ution  of 


X  +  Zy-jX  +  a>2x  -  ex^ 


=  eCgSina)2t 


x(0)  =  x(0)  =  0  , 


a|(0) 


+  a2(0)-^(M2  -  e"^^l^)sin[a)2t  +  e2(t)]  . 


the  solution  in  (99)  tends  to  the  steady  state  solution 


x(t)  -  a2(0)(l  +  c)sin(u)2t  +  t|;2)  , 


(lUO) 


where 


-1 

?  =  a2(0)/v2  ,  ip  =  tan  (-^-~^)  -  (eH7v2/2Yi )tn(l  +  c)  ■ 

>  C,  U)^  ~  1  '  * 
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To  assess  the  quality  of  the  approximation  in  (99),  we  have  also  solved 
numerically  the  initial  value  problem  stated  in  (98).  These  "exact"  and 
approximate  solutions  have  been  plotted  in  Figures  9  and  10  for  =  1,  « 

4,  Qp  =  1,  and  e*  0.1,  with  =  0.25  and  =  0.75,  respectively.  In  both 
cases,  the  exact  and  approximate  curves  for  x(t)  versus  t  are  in  excellent 
agreement.  The  decay  of  the  transient  portion  of  the  solution  is  quite  evi¬ 
dent.  Moreover,  the  attainment  of  the  steady  state  motion  is  clearly  visible 
in  Figure  10  since  the  value  of  the  damping  coefficient  is  three  times  greater 
than  the  value  used  in  the  plotting  of  Figure  9. 

SUMMARY  AND  CONCLUSIONS.  A  method  that  can  be  applied  for  the 
determination  of  approximate  analytical  solutions  of  the  class  of  nonhomogen- 
eous  linear  and  nonlinear  differential  equations  of  second  order  stated  in  (4) 
has  been  developed,  it  is  hypothesized  that  the  forcing  function  Q(t)  in  (4) 
satifies  the  linear  second  order  differential  equation  with  constant  coeffici¬ 
ents  shown  in  (5).  The  class  of  physical  problems  considered  is  character¬ 
istic  of  the  linear  and  nonlinear  theories  of  oscillatory  systems.  The  pre¬ 
sent  method  represents  an  extension  of  Burnelle's  method  of  averaging  [3], 
which  specifically  accounts  for  the  presence  of  linear  but  subcritical  damping 
in  the  system.  Burnelle's  technique,  which  is  applicable  to  homogeneous 
differential  equations,  may  be  viewed  as  a  generalization  of  the  Krylov- 
Bogoliubov  method  of  averaging  [1],  in  which  only  small  linear  damping  is 
considered.  The  linear  damping  term  is  understood  to  be  a  component  of 
f(t,x,x). 

In  the  technique  presented  here,  x(t)  is  assumed  to  possess  a  solution  in 
the  form  shown  in  (12),  where  the  amplitudes  a.(t)  and  the  phase  of  angles 

J 

®j(t),  j  =  1,2,  are  to  be  determined.  The  method  of  variation  of  parameters 
is  applied,  and  a  system  of  first  order  differential  equations  in  the  a.'s  and 

J 

e.'s  is  derived.  To  simplify  this  system  of  differential  equations,  the 

J 

averaging  operators  in  (30)  are  applied  to  (21)  to  (24).  It  may  be  argued 
that  the  right  sides  of  these  differential  equations  are  slowly  varying  func¬ 
tions  of  time  t.  This  process  reduces  the  complicated  system  to  a  much 
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simpler  system  that  is  in  many  cases  tractable  by  elementary  methods.  In 
essence,  this  has  been  accomplished  by  expressing  the  right  sides  of  (21)  to 

(24)  as  double  Fourier  series  in  and  and  then  approximating  the  resulting 

expressions  through  retention  of  only  the  constant  term  (i.e.,  the  average 
value  of  the  right  side)  of  the  Fourier  series.  It  is  this  averaging  process 
that  introduces  the  approximation  nature  into  the  technique  of  solution. 

The  averaged  differential  equations  in  (34)  were  then  solved  for  some 
specific  choices  of  the  function  f  (t,x,x)  that  appears  in  (4).  Firstly,  a 
class  of  linear  differential  equations  was  considered  for  the  form  of  f(t,x,x) 
shown  in  (36).  The  integrated  forms  of  a-(t)  and  0.(t)  are  presented  in  (43). 

J  J 

As  illustrative  examples,  nonhomogeneous  forms  of  the  differential  equations 
of  Hermite,  Mathieu  and  Bessel  have  been  solved  approximately  by  the  proposed 
method.  To  assess  the  validity  of  the  analytical  approximations,  these  same 

initial  value  problems  were  solved  numerically  for  several  values  of  the 

parameters  by  the  Runge-Kutta  method,  herein  called  the  "exact"  method.  Plots 
of  x(t)  versus  t  based  upon  both  the  analytical  approximations  and  the  numer¬ 
ically  exact  computations  were  prepared  on  the  same  sets  of  axes.  These 
numerical  results,  shown  in  Figures  1  to  6,  proved  to  be  in  excellent  qual¬ 
itative  and  relatively  good  quantitative  agreement. 

Secondly,  Duffing's  nonlinear  differential  equation  including  the  effect 
of  subcritical  linear  damping  has  also  been  solved  in  an  approximate  sense  by 
the  proposed  method.  Both  hard  and  soft  nonlinear  springs  have  been  consid- 
ered,  i.e.,  f(t,x,x)  =  j^x  .  Numerical  calculations  revealed  that  the  analy¬ 
tical  approximation  provides  a  very  accurate  representation  of  the  solution  of 
the  nonlinear  differential  equation  even  into  the  steady  state  domain.  The 
exact  and  approximate  curves  have  been  plotted  in  Figures  7  to  10.  The  human 
eye  cannot  distinguish  between  them  on  the  scale  of  the  axes  employed. 

Consequently,  the  method  developed  here  offers  a  practical  technique  for 
the  determination  of  rather  accurate  analytical  approximate  solutions  for 
certain  types  of  nonhomogeneous  linear  ordinary  differential  equations.  It 
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Thi  s 


should  be  noted,  however,  that  the  solutions  are  not  valid  whenw^-jWj^. 
case  requires  a  modification  of  the  form  of  the  solution  assumed  here  for 
x(t).  The  resolution  of  this  situation  remains  a  problem  for  further  research. 
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Figure  6  .  Variation  of  x<t)  versus  t  for  CJ»  =  10,  =  6, 

Qg  =  1,  and  6  =  0.  i  according  to  (  60  ). 
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Figure  .  Variation  of  x(t)  versus  t  according  to  (  88  .) 
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Figure  ^£> .  Variation  of  x<t)  versus  t  according  to  (  99  ) 
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ABSTRACT ■  In  a  previous  paper,  a  method  for  computing  the  stresses  due 
to  the  combined  effects  of  transient  temperatures  and  material  phase 
transformations  was  described*  The  general  purpose  finite  element  program 
ADINAT/ADINA  was  used  for  the  computation  of  both  the  transient  temperatures 
and  the  associated  stresses.  The  problem  considered  was  that  of  an 
axisymmetric  hollow  cylinder  undergoing  a  water-spray  quench.  The  present 
work  considers  a  similar  model,  but  is  better  able  to  describe  the  residual 
stress  state  because  of  the  availability  of  a  more  accurate  set  of  properties 
for  the  material  expansion  due  to  the  phase  transformation.  Effects  on  the 
transient  and  residual  stresses  due  to  modifications  of  the  material  expansion 
and  varying  quench  rates  are  discussed - 

It  is  found  that  the  stresses  due  to  the  transformation  are  more  severe 
than  those  due  to  the  transient  temperatures  alone*  Inelastic  behavior  is 
found  to  occur  in  all  the  cases  considered  and  high  residual  stresses  can 
exist  on  the  inner  and  outer  surfaces*  I^ile  dependent  on  actual  material 
composition,  these  residual  stresses  can  lead  to  quench  cracking. 

The  model  describes  the  rapid  quenching  of  steel  gun  tubes  for  the 
purpose  of  developing  a  martensitic  grain  structure  and  desired  physical 
properties  in  the  tube. 

I*  INTRODUCTION.  Rapid  quenching  of  components  from  initially  high 
temperatures  usually  results  in  development  of  residual  stresses  within  the 
components  upon  cooling*  The  quenching  is  normally  undertaken  to  develop  a 
desired  microstructure  in  the  material  which  would  determine  its  behavior  or 
response  in  use.  These  changes  in  microstructure,  or  transformations,  cause 
volume  changes  which  can  give  rise  to  stresses  in  the  component  in  addition  to 
the  stresses  due  to  the  rapid  temperature  changes.  Thus,  the  residual  state 
of  stress  which  exists  when  the  component  is  cooled  to  room  temperature  is  due 
to  the  combined  effect  of  transient  temperatures  and  material  phase 
t rans format ion - 

In  a  previous  paper  [1],  a  method  for  computing  the  stresses  due  to  these 
combined  effects  was  described.  The  general  purpose  finite  element  program 
ADINAT/ADINA  was  used  for  the  computation  of  both  the  transient  temperatures 
and  the  associated  stresses.  The  problem  considered  was  that  of  an 
axisymmetric  hollow  cylinder  undergoing  a  water-spray  quench.  The  present 
work  considers  a  similar  model  but  is  better  able  to  describe  the  residual 
stress  state  because  of  the  availability  of  a  more  accurate  set  of  properties 
for  the  material  expansion  due  to  the  phase  transformation.  The  material 
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properties  and  system  parameters  used  in  the  computations,  such  as  quenching 
time,  were  chosen  using  the  rotary  forge  quench  facility  at  Watervllet  Arsenal 
as  a  model* 

The  quench  facility  has  nozzles  on  several  diametral  planes  for  spraying 
water  on  the  outer  diameter  of  a  long  tube  as  it  is  slowly  rotated-  The  bore, 
or  inner  surface  of  the  tube,  can  also  be  cooled  by  a  bore  flush-  There  have 
been  several  types  of  quench  cycles  in  the  past.  These  include  varying  quench 
times  of  the  outer  diameter  for  the  breech  and  muzzle  ends  of  the  tube.  The 
long  tubes  are  usually  of  constant  bore  diameter  and  varying  outer  diameter 
with  the  larger  being  the  breech  end  and  the  smaller  the  muzzle  end- 
Quenching  of  the  bore  can  be  omitted,  delayed,  simultaneous  with  the  outer 
diameter  quench,  etc-  The  goal  was  to  develop  the  design  properties  in  the 
tube  without  causing  quench  cracking  due  to  the  high  residual  tensile  stresses 
at  the  bore* 

Since  the  diameter  of  these  tubes  varies  slowly,  end  effects  are  ignored 
and  the  tube  is  treated  as  a  long  axisymmetrlc  cylinder-  In  the  earlier  work 
[1],  the  transient  temperatures  and  combined  stresses  for  the  breech  and 
muzzle  ends  were  treated  as  two  separate  cases.  This  earlier  work  also 
considered  the  effects  of  the  different  quench  cycles.  The  geometry  used  is 
that  of  the  muzzle  or  smaller  end  of  the  long  tube*  It  is  also  assumed  that 
no  bore  quench  takes  place.  This  was  done  because  most  of  the  quenching 
currently  being  undertaken  at  the  facility  does  not  use  the  bore  quench* 

Latent  heat  Is  ignored  in  the  computation  of  the  transient  temperatures-  This 
is  not  due  to  a  limitation  of  the  model,  but  to  a  lack  of  appropriate  input - 

Based  on  recent  experimental  work  [2],  the  residual  stresses  can  now 
be  computed  from  realistic  temperature-transformation  curves.  Also,  because 
of  Information  on  the  martensite  transformation  itself,  the  effect  of  new 
quench  cycles  on  the  cooling  curves  can  be  seen.  The  resultant  residual 
stress  distributions  can  then  be  discussed- 

II-  PROBLEM  STATEMENT-  Thermal  and  transformation  stresses  are  computed 
for  long  hollow  cylinders  as  they  are  being  quenched.  The  effects  due  to  the 
transient  temperature  distributions  and  the  martensite  transformation  are  both 
considered  in  the  stress  calculations*  The  thermo-physical  properties  are 
assumed  to  be  temperature  dependent-  The  residual  stress  distributions  at  the 
end  of  the  quench  cycles  are  presented-  Both  experimentally  developed  and 
assumed  transformation-temperature  curves  are  used,  and  the  quench  cycle  is 
varied.  A  general  purpose  finite  element  program  ADINAT/ADINA  is  used  for  the 
computations* 

III.  FINITE  ELEMENT  PROGRAM*  The  finite  element  geometry  for  the 
problem  is  shown  in  Figure  1,  along  with  a  simplified  drawing  of  a  gun  tube, 
Eight  node  quadrilateral  elements  are  used  in  the  model-  The  present  work 
shows  results  only  for  the  muzzle  end  of  the  tube*  In  the  earlier  work  [1], 
stress  and  temperature  results  for  the  breech  end  of  the  tube  due  to  different 
quench  cycles  and  an  assumed  transformation-temperature  curve  were  presented* 

The  finite  element  program  actually  consists  of  two  parts,  one  for 
computing  temperatures,  ADINAT,  and  one  for  computing  stresses,  ADINA*  Each 
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program  can  stand  alone,  but  when  one  wishes  to  compute  thermal  stresses  using 
the  same  geometry,  ADINAT  produces  a  file  which  includes  the  temperatures  at 
the  node  for  each  time  step  if  the  problem  is  a  transient  one.  This  ADINAT 
output  file  can  then  be  used  as  input  to  ADINA  for  the  stress  computation. 

In  the  program,  the  thermo-physical  properties  were  considered  as 
functions  of  temperature.  The  convection  losses  during  the  heat  transfer 
portion  of  the  computation  are  considered  to  be  due  to  the  temperature 
difference  between  the  tube  wall  and  ambient,  which  is  assumed  to  be  18. 3® C 
(65®F).  For  the  computation  of  stresses,  one  has  a  choice  of  several  material 
behavior  models  in  ADINA.  The  one  chosen  for  this  work  was  Model  10  [3]  which 
is  applicable  to  the  thermo-elastic-plastic  solution  of  Interest.  The  yield 
criterion  assumed  was  the  distortion  energy  criterion  and  the  yield  stress  was 
assumed  to  be  a  function  of  temperature*  No  creep  or  hardening  was  assumed 
although  the  model  allowed  both  to  be  incorporated. 

To  compute  thermal  stresses,  the  problem  for  the  transient  temperatures 
was  solved  using  just  ADINAT  as  indicated  above.  The  special  file  created  by 
ADINAT  was  then  used  as  input  to  ADINA  to  compute  the  thermal  stresses. 
However,  in  many  cases  in  solving  the  temperature  problem,  time  increments 
vary.  Short-time  increments  are  used  during  periods  of  large  transients, 
and  longer-time  increments  when  the  temperature  gradients  are  not  as  severe. 
While  ADINAT  allows  one  to  change  time  increments,  ADINA  does  not.  This 
difficulty  was  overcome  by  manipulating  the  temperature  file  used  for  stress 
computation  so  that  with  the  restart  capability,  ADINA  would  see  only  one  time 
increment  during  any  one  computation  interval.  Finally,  the  restart  facility 
in  ADINA  [4]  was  altered  so  that  a  restart  could  be  undertaken  from  any 
previous  time  instead  of  just  the  last  completed  step. 

The  computation  of  transformation  stresses  and  combined  thermal  and 
transformation  stresses  can  be  treated  like  thermal  stresses  with  little 
additional  effort-  The  effect  of  the  transformation,  at  least  the  aspect  of 
it  giving  rise  to  stresses,  is  to  create  a  volume  change  in  the  material.  In 
this  case  the  volume  change  is  an  increase,  and  it  occurs  when  the  temperature 
at  a  point  in  the  material  becomes  equal  to  the  martensite  start  (Mg) 
temperature  and  is  completed  when  it  reaches  the  martensite  finish  (Mp) 
temperature-  If  the  transformation  is  assumed  to  be  Isotropic,  then  the 
linear  expansion  can  be  taken  as  one-third  the  volume  change.  Reference  1 
describes  what  was  done  when  the  expansion  due  to  the  volume  change  was 
available  as  a  separate  quantity  from  the  thermal  expansion  coefficient  of  the 
material.  The  current  work  takes  advantage  of  the  fact  that  the  expansion  is 
available  from  experiments  as  the  combined  effect. 

IV.  EXPERIMENTAL  WORK.  Cote  [2]  is  conducting  tests  on  various  steels 
used  in  the  manufacture  of  large  caliber  cannon.  Small  samples  of  the  steels 
are  quenched  at  different  rates  and  the  expansion  in  each  sample  is 
determined.  He  supplied  the  data  for  the  experimental  curve  for  the  linear 
expansion  versus  temperature  curve  shown  in  Figure  2.  He  also  suggested  the 
modified  curve  as  being  typical  of  another  type  of  steel  or  one  in  which  some 
balnite  has  formed  in  addition  to  martensite.  Other  aspects  of  his  work 
indicate  that  the  desired  microstructure  can  be  achieved  with  slower  quench 
cycles.  This  work  has  guided  some  of  the  decisions  as  to  what  quench  cycles 
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to  run,  what  transformation  curves  to  use,  etc. 


V.  RESULTS.  The  early  work  [1]  was  based  on  the  assumed  transformation- 
temperature  curve  shown  in  Figure  2*  The  experimental  curve  was  not  available 
at  that  time.  Based  on  that  curve,  however,  several  different  quench  cycles 
were  run  to  determine  the  transient  temperature  distributions  and  the 
associated  residual  stresses*  The  quench  cycles  previously  run  were; 

1*  Bore  quench  started  at  the  same  time  as  the  OD  quench. 

2.  Bore  quench  started  30  seconds  before  the  OD  quench  began. 

3-  Bore  quench  started  30  seconds  after  the  OD  quench  began. 

4.  No  bore  quench* 

These  were  run  for  both  the  muzzle  and  the  breech  end  of  the  gun  tube*  For 
comparison  purposes,  some  of  this  earlier  work  is  shown  in  Figures  3  through 
5*  The  results  are  for  the  muzzle  end  with  no  bore  quench  taking  place* 

Figure  3  shows  the  variation  with  time  of  the  tangential  stress  at  the  bore 
and  in  the  outer  surface.  Since  this  was  a  four  minute  quench  simulation,  the 
stresses  at  time  250  seconds  are  the  residual  stresses  at  those  respective 
points*  The  breaks  in  the  curves  indicate  when  the  phase  transformation  would 
begin  and  end  in  the  bore  and  on  the  outer  surface.  The  onset  of  inelastic 
deformation  can  also  be  found  as  can  changes  in  the  slope  of  the  phase 
transformation-temperature  curve*  Figure  4  shows  the  residual  stress 
distribution  throughout  the  cylinder  wall*  High  compressive  stresses  are  seen 
at  the  bore  and  high  tensile  stresses  on  the  outer  surface.  Figure  5  is  a 

series  of  figures  showing  the  tangential  stress  distribution  across  the  wall 

thickness  at  different  times.  The  transformation  has  started  on  the  outer 
diameter  by  150.5  seconds  and  continues  into  the  cylinder.  It  begins  on  the 
ID  by  175*5  seconds  and  the  residual  stress  state  (similar  to  Figure  4)  is 
seen  at  time  245.5  seconds. 

The  results  for  other  transformation  curves  in  Figure  2  are  shown  in  the 
remaining  figures.  The  phase  transformation  curves  used  here  for  the 
computation  of  residual  stresses  are  the  experimentally  determined  one  and  the 
one  modified  from  it*  In  addition,  each  was  run  with  a  fast  quench  4 

minutes)  and  a  slow  quench  (~  15  minute)  cycle.  Boundary  conditions  for  the 

computation  of  temperatures  are  not  readily  available.  The  only  known 
information  is  that  the  temperature  on  the  muzzle  end  of  the  tube  reaches 
about  200® F  in  approximately  four  minutes-  This  point  is  about  eight  minutes 
on  the  breech  end.  A  constant  convection  coefficient,  h,  is  found  by 
exercising  the  program  which  gives  us  this  point  and  is  used  in  subsequent 
runs*  The  geometry  for  the  muzzle  end  was  used  and  no  bore  quench  was 
considered*  Figures  6  through  9  represent  the  transient  temperatures 
experienced  by  the  tube  during  quenching.  Figure  6  is  the  response  of  the 
fast  quench  and  the  temperature  distribution  throughout  the  wall  at  selected 
times  is  shown.  The  effect  on  the  temperature  distributed  due  to  the  OD  only 
quench  Is  easily  seen,  especially  at  early  times*  The  difference  between  the 
outer  diameter  and  bore  diameter  temperatures  at  any  time  is  found  in  Figure 
7,  again  for  the  fast  quench  cycle.  Figures  8  and  9  show  the  thermal  results 
for  the  slow  quench  cycle,  with  Figure  8  showing  the  radial  distribution  of 
temperatures,  and  Figure  9  the  OD  and  ID  temperature  as  they  vary  in  time*  It 
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is  easily  seea  that  the  temperature  difference  between  the  ID  and  OD  is  much 
less  during  the  slow  quench* 

Figures  10  and  11  show  some  of  the  stress  results  due  to  the  experimental 
phase  transformation  curve  and  a  fast  quench  cycle*  This  is  perhaps  the  worst 
case  as  far  as  high  stress  gradients  and  residual  stresses  are  concerned*  The 
distribution  of  tangential  stresses  throughout  the  wall  is  shown  in  Figure  10 
for  specific  times*  At  time  121  seconds  the  transformation  has  begun  on  the 
OD,  and  at  time  150  seconds  it  has  begun  on  the  ID*  The  effect  on  the 
stresses  is  due  primarily  to  the  volume  change  associated  with  the 
transformation.  This  can  easily  be  seen  by  considering  the  stresses  prior  to 
the  transformation  beginning  and  the  changes  to  the  state  of  stress  after  it 
has  been  completed*  Figure  11  shows  the  fluctuation  of  the  stresses  at  the  ID 
and  OD  of  the  tube  throughout  this  quench  cycle*  Again  the  breaks  In  the 

curve  are  due  to  the  onset  of  transformations,  the  onset  of  inelastic  material 

behavior,  and  changes  in  slope  of  the  transformation  curve*  High  tensile 
stresses  are  found  at  the  bore.  Figures  12  and  13  show  the  results  for  the 
experimental  transformation  curve  and  the  slow  quench.  Since  it  was  thought 
that  the  desired  material  microstructure  would  be  developed  even  during  a  slow 
quench  for  a  specific  steel,  the  combination  of  loads  was  run-  From  Figure  12 
one  can  see  that  the  initial  thermal  stresses  are  small,  as  the  thermal 
gradients  are  much  less  during  the  slower  quench*  The  transformation  on  the 
OD  started  about  460-470  seconds  into  the  run  and  on  the  ID  probably  about 
40-50  seconds  later.  The  residual  state  of  stress  is  shown  at  the  end  of  the 

quench  cycle*  High  residual  stresses  are  found  at  both  the  ID  and  OD  but  do 

not  appear  to  penetrate  into  the  interior  of  the  tube  section  as  they  did 
during  the  rapid  quench  cycle.  Bore  tangential  stress,  however,  is  still 
tensile*  Figure  14  shows  the  history  of  the  tangential  stress  at  the  bore  and 
OD.  Most  of  the  action  occurs  during  the  period  from  450-625  seconds  when  the 
material  is  undergoing  the  transformation. 

By  subjecting  the  material  to  more  time  at  the  austenitizing  temperature 
or  perhaps  by  the  changing  the  austenitizing  temperature,  it  may  be  possible 
to  change  the  shape  of  the  phase  transformation-temperature  curve-  For  this 
reason,  it  was  decided  to  look  at  the  type  of  results  that  would  arise  if  the 
modified  transformation  curve  in  Figure  2  was  used*  Figures  14  and  15  show 
the  results  for  this  curve  with  the  first  quench  cycle-  While  the  high 
stresses  do  occur  on  the  ID  and  OD,  the  penetration  of  these  stresses  into  the 
interior  is  not  as  much  as  it  was  with  the  experimentally  determined 
transformation  curve.  Figures  16  and  17  depict  the  results  for  the  modified 
transformation  curve  and  slow  quench*  In  this  case,  the  residual  stresses  are 
small* 


VI.  CONCLUSIONS*  Using  experimental  results  as  a  guide,  this  work 
looked  at  the  residual  stress  states  in  a  long  hollow  tube  which  arise  when 
the  cylinder  is  quenched*  These  stresses  are  due  to  the  transient 
temperatures  during  quenching  and  the  material  transformation  that  occurs. 
High  tensile  stresses  occurred  at  the  bore  for  all  runs*  These  are  the 
stresses  that  can  lead  to  quench  cracking* 
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In  all  runs,  the  stresses  that  occurred  due  to  the  phase  change  were  much 
more  severe  than  those  due  to  the  thermal  gradients*  During  the  slow  quench, 
the  thermal  stresses  were  very  small  by  comparison*  For  the  more  rapid 
quench,  although  more  severe,  they  still  were  much  less  than  the 
transformation  stress-  One  should  recall  that  the  cylinder  modeled  is  a  steel 
one  and  the  thermal  conductivity  is  high  tending  to  keep  the  gradients  small. 
Varying  the  quench  cycle,  e-g.,  introducing  bore  quench,  would  not 
significantly  change  the  stresses  due  to  the  temperature  gradients  alone. 

The  stresses  due  to  the  transformation  cause  inelastic  material  behavior, 
almost  from  the  time  the  transformation  begins*  At  the  higher  temperatures, 
it  is  expected  that  yielding  could  occur  without  generating  cracks  due  to  a 
more  ductile  response.  A  slow  quench  with  the  experimentally  determined 
transformation  curve  or  either  quench  with  the  modified  transformation  curve  ^ 
shows  more  shallow  areas  on  the  ID  and  OD  for  high  residual  stresses.  Less 
material  is  thus  subject  to  high  stresses.  Although  this  analysis  cannot 
predict  the  onset  of  cracks  due  to  quenching,  it  would  tend  to  indicate  that 
fast  quench  and  experimental  transformation  curve  would  have  a  higher 
propensity  for  cracking.  Either  modifying  the  transformation  curve  or  slowing 
the  quench  would  help  decrease  the  possibility  of  quench  cracking. 
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PROBLEM  GEOMETRY 


CHANGE  IN  LINEAR  EXPANSION  COEFFICIENT 
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PIGURE  2. 
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FIGURE  5 


TRANSIENT  TEMPERATURES  IN  TUBE 
DURING  QUENCH 
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FIGURE  6. 
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TANGENTIAL  STRESS  vs  RADIUS  FOR  SPECIFIC  TIMES  DURING  QUENCH 
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FURTHER  INVESTIGATION  OF  THE  STABILITY  OF 
DIFFUSION  FLAMES  NEAR  EXTINCTION* 


Y.S.  Choi  and  G.S.S.  Ludford 
Department  of  Theoretical  and  Applied  Median Icb 
Cornell  University,  ittiuca,  NY  14853 


ABSTRACT .  For  fixed  1^  (the  fuel  Lewis  number)  equal  to  I,  strong 
dependence  of  the  exchange  of  stability  point  on  (the  oxidant  Lewis 
number)  is  found  near  extinction.  On  the  other  hand,  with  i^  =  1  the 
effect  of  varying  on  tlie  location  of  the  point  Is  feiund  to  be  minute. 

1.  INTRODUCTION.  In  a  paper  [1]  at  the  last  Army  Conference,  we 
described  an  Investigation  of  both  the  near-extinction  and  near-ignition 
stability  of  diffusion  flames.  The  emphasis  was  on  Lewis-number  effects 
and,  for  flames  near  extinction,  the  results  were  of  a  preliminary  nature. 
The  ones  presented  here,  though  not  definitive,  bring  the  story  up  to  date. 
The  investigation  continues  and  we  expect  the  remaining  difficulties  to  be 
be  overcome  soon. 


2.  GOVERNING  EQUATIONS  FOR  NEAR-EXTINCTION  ANALYSIS.  The  near¬ 
extinction  steady  states  (corresponding  to  the  bottom  half  of  the  S-responoe 
in  figure  1)  have  been  described  in  [2].  The  perturbation  temperature 
satisfies  the  differential  equation 


d^t 


=  -Ki 


oLp(k^5+k2-t^)(k3C+k^-t^)e 


(1) 


Figure  1.  S-shaped  response  for  chambered  diffusion  flame:  R  is  the 
fraction  of  unburnt  fuel  and  M  is  the  Injection  rate. 

^Supported  by  the  U.S.  Army  Research  Office. 
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and  the  boundary  conditions 


+  0(1) 

tg  -  as  C  ^  (2) 

k^S+kj^C  +  0(1) 

Here  kj^  is  a  known  positive  constant,  k^  =  PC,  k^  is  a  known  negative 
constant,  k^  =  QC,  where  P,  Q  are  known  constants  and  B,  C  are  unknown 

constants;  the  constant  K  is  positive.  The  numerics  determine  exactly 

two  solution  for  each  K  greater  than  a  certain  K*(L_,I.„),  exactly  one 

U  F 

for  K  =  and  none  for  K  <  K*. 


From  these  solutions  the  two  responses 

R  -  -6  C, 
a 

where  6  <<  1  is  a  (knoxm)  small  positive  constant,  can  be  calculated 

a 

for  each  K  >  K*,  thereby  generating  the  middle  and  lower  branches  of  the 
S-shaped  response  curve  in  figure  1.  The  bend  is  approached  as  K  K* 
and  remote  parts  of  the  two  branches  as  K  ->■  «>, 


The  corresponding  stability  problem  is 
t 


=  Ke  ®(yos<*’F'^yFs‘<’9'^Wos**’T^’ 


dr 


=  L 


_1 


0 


0  dt 


-  --F  - 


2  ^“^F 


(3) 


<|),j,(±“)  =  <)'q(±«’)  =  4>p(±®)  =  0, 


(4) 


where 


are  the  mass  fractions  corresponding  to  the  steady  state  temperature 

perturbation  t  .  Here  X  is  the  eigenvalue;  if  the  spectrum  has  non- 
s 

negative  real  part  for  a  steady  state  t  ,  that  state  is  stable  to  the 

s 

class  of  disturbance  considered;  otherwise  it  is  unstable. 

3 .  NUMERICAL  RESULTS .  In  our  previous  paper  [1],  we  reported 
preliminary  results  of  an  extinction  analysis.  Here,  a  more  accurate 
calculation  is  being  recorded. 
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For  L  =  L  =  1,  more  careful  computation  shows  agreement  with  Buckmaster. 
U  r 

Nachman  and  Teliferro^s  result  ([3])  i*e.,  change  of  stability  occurs  at 
the  turning  point  of  steady  states. 

With  general  have  to  deal  with  3  second-order  simultaneous 

equations  on  a  doubly  infinite  domain  in  order  to  extract  the  required 
eigenvalue.  Computations  show  that  although  the  steady-state  response 
can  generally  be  obtained  to  a  higher  degree  of  accuracy  with  a  reasonable 
numerical  infinity,  the  corresponding  stability  problem  requires  a  much 
larger  numerical  infinity  for  convergence  of  eigenvalues.  In  some  cases, 
we  even  have  difficulty  in  getting  the  eigenvalue  converged.  (For  small 
Lq,  for  example  0.1,  we  even  had  difficulty  in  getting  the  steady-state 

response,  because  C  is  so  small.) 

From  these  numerical  experiments,  it  is  found  that  the  real  part  of  the 
smallest  eigenvalue  decreases  as  the  numerical  infinity  is  increased. 

Hence,  once  a  negative  eigenvalue  is  obtained  on  the  lower  branch,  we  may 
expect  the  location  of  the  change  of  stability  point  to  remain  on  the 
lower  branch  when  more  accurate  computations  are  made. 

With  L  fixed  at  1,  for  very  large  L  the  change  of  stability  point  is 
r  U 

on  the  lower  branch.  As  we  gradually  decrease  to  1,  the  point  moves 

to  the  turn.  Further  decrease  of  makes  the  change  of  stability  point 

move  back  to  the  lower  branch  until  a  certain  critical  value  is  attained 
which  we  conjecture  to  be  ^^o^crit  ”  [2]).  As 

we  now  decrease  further  still,  the  change  of  stability  point  moves  towards 
the  turn  again.  For  about  0.13,  it  is  very  close  to  the  turn.  How¬ 

ever,  numerical  difficulties  prevent  us  from  getting  information  for 
smaller  values  of 

On  the  other  hand,  with  =  1  and  L  varied,  the  effect  on  the 

U  J? 

change  of  stability  point  is  very  minute,  at  least  for  the  few  cases 
that  have  been  tested. 

Re-examination  of  the  numerical  procedure  is  in  progress.  Some 
theoretical  results  and  ideas  have  also  been  developed.  We  hope  to  give 
a  better  justification  of  the  numerical  results  obtained  in  the  near 
future . 
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COMPLEX  KINETICS  IN  FLAME  THEORY’^ 


G.S*S.  Ludford  and  Richard  Y*  Tam 
Department  of  Theoretical  and  Applied  Mechanics 
Cornell  University 
Ithaca,  NY  14853 


ABSTRACT .  The  Zeldovich-Linan  model  with  two-step  kinetics  is  chosen 
to  illustrate  the  complexities  involved  and  the  methods  used  for  complex 
kinetics.  A  systematic  procedure  due  to  Fife  and  Nicolaenko  characterizes 
each  reaction  by  its  power  function,  i.e.  the  part  of  the  reaction  source 
term  that  does  not  depend  on  the  mass  fractions.  Of  particular  interest  are 
the  cross-over  temperature  T^,  at  which  the  two  power  functions  are  equal, 

and  its  relation  to  the  upper  the  lower  bounds,!  and  T^,  of  the  burnt 

u  i 

temperature,  which  of  course  depend  on  the  state  of  the  fresh  mixture.  The 
possibilities  T  <T<T,T<T  <T,T<T  <T  lead  to  three 

C  j6u36  CU^'UC 

different  flame  structures,  in  which  the  flame  temperature  is  T„ ,  T  ,  T 

^  c  u 

respectively.  Once  the  flame  temperature  has  been  determined  the  analysis 
follows  that  in  the  familiar  asymptotic  treatment  of  one-step  kinetics. 

INTRODUCTION .  The  one-step  kinetic  model  has  served  combustion 
modelling  well.  However,  some  important  phenomena,  e.g.  flame  quenching, 
are  not  reproducible  by  the  one-step  model.  The  natural  extension  is  there¬ 
fore  to  multi-step  kinetics  which  involve  intermediate  products  called 
radicals.  It  has  long  been  known  that  free  radicals  can  exert  considerable 
influence  on  combustion  processes.  A  simple  two-step  model  that  involves 
one  intermediate  species  is  the  Zeldovich-Linan  model; 

A  +  X  ^  2X 
2X  +  M  P  +  M 

consisting  of  a  chain-branching  (production)  step  and  a  chain-breaking 
(recombination)  step.  Here  X  is  the  radical,  A  the  reactant  and  M  a 
third  body.  The  activation  energy  of  the  production  step  is  very  large, 
while  that  of  the  recombination  step  is  small  and  thus  taken  to  be  zero. 

We  shall  use  this  model  to  illustrate  a  systematic  method  of  evaluating 
complex  kinetic  schemes  that  is  due  to  Fife  and  Nicolaenko  (see  Nicolaenko, 
1985) . 


(1) 
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dx 
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Here  T  is  the  temperature,  while  X  and  Y  are  the  mass  fractions  of  radical 
and  reactant  (respectively) ;  K  and  L  are  the  Lewis  numbers  of  the  radical 
and  reactant; 

Vj^  = 

where  and  are  the  rate  constants  of  the  reaction  steps; 

proportions  of  the  total  heat  released  in  the  first  and  second 
the  reaction,  so  that 


and  are  the 
steps  of 


Hi  +  q2  =  1; 

0  is  the  activation  energy  of  the  first  step,  that  , of,  the  second  step 
being  taken  zero.  Given  the  parameters  L,  K,  and  D^, 

the  problem  is  to  determine  the  burning  rate  M  for  which  these  differential 
equations  have  a  solution  satisfying  the  boundary  conditions 

X,  Y,  T  ^  0,  Y^,  as  x  -«>, 

X,  Y,  dT/dx  ^0  as  X  ->  -Ho. 

The  solution  is  sought  in  the  limit  6  -»■  <»  . 

POWER  FUNCTIONS.  Integrating  the  equations  with  respect  to  x  from 
to  -Ho  and  noting  that  the  derivatives  at  both  ends  tend  to  zero,  we  get 

^f  "  “l*  ^  "  “l  "  “2*  ’’b  “  \  "  '‘l^l  ‘^2“2’ 

where 

°°  /  ^  o 

=  f  ^  "^dx, 

—CO  — OO 

Note  that  0  :!  which  at  the  two  ends  of  the  inequality  yields 

-  Tj  +  qjYj,  =  Tf  +  Iff! 


these  two  values  of  T-  depend  on  the  fresh  state.  Of  course,  we  have 

b 

T  <  <  T  ,  where  T*  is  the  flame  temperature. 

The  part  of  the  reaction  source  term  that  does  not  depend  on  the 
mass  fractions  is  called  a  power  function,  which  we  denote  by 


-1  = 


“2  "  '’2- 


The  cross— over  temperature  is  that  temperature  at  which  the  power  functions 
are  equal,  l.e. 
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»  D_  or  T  =  e/(S,nD,-JLnD  ). 

z  c  X  z 


-e/T 


C 


Thus,  in  contrast  to  the  two  values  of  the  burnt  temperature  found  above, 
the  cross-over  temperature  depends  on  details  of  the  reaction  such  as 
activation  energy,  rate  constants,  etc* 


Consider 


rOO  2 

/  H^X  dx 

— OD 

/"H^XYdx 

— 'GO 


and  note  that  must  be  no  smaller  than  a2  (since  must  be  nonnegative)* 

Suppose  >  D^*  so  the  curves  of  the  power  functions  and  intersect 

to  define  the  cross-over  temperature  as  shown  in  fig*  1*  In  the  limit 

e  00,  for  which  all  the  first  reaction  is  concentrated  at  a  flame  sheet, 
there  are  three  possible  relations  between  the  flame  temperature  and  the 
cross-over  temperature  T^,  in  the  discussion  of  which  we  shall  use,  <<  to 

denote  exponentially  smaller  than. 

(i)  T .  <  T  :  Then  <<  H.  and  the  only  way  to  satisfy  the  inequal- 
”  c  i  z 

ity  _<  1  is  to  take  X  <<  1  everywhere.  This  implies 

Xl  =  0  and  =  Y  ,  so  that  X .  =  T,  =  T  ,  Recombination  is 
Z  t  ^  b  u 

completed  within  the  flame  sheet, 

(ii)  T^  >  T^:  Then  and  a2  «  ot^;  no  restriction  is  imposed 

on  the  order  of  magnitude  of  X.  Consequently,  we  have  -  0, 

from  which  follows  X-  =  Y _  and  T .  =  T.  =  T„ *  Radicals  remain 

of  *  b 

at  X  =  -h»,  and  recombination  takes  place  over  exponentially  long 
distances,  i.e.  downstream  of  what  may  be  called  the  flame  proper* 

(iii)  T.  =  T  :  H„  being  comparable  (exponentially)  to  H  ,  the  recombin- 
c  z  1 

ation  reaction  must  go  to  completion  on  the  x-scale,  and 

consequently,  =  0,  =  Y^  and  T^  <  '  1^* 

THE  FIVE  POSSIBLE  FLAME  STRUCTURES,  Now,  there  are  three  possible 
orderings  of  the  cross-over  temperature  T  and  the  burnt-temperature  bounds 
T.  and  T  (two  additional  limiting  cases  can  also  be  identified).  The 

Jo  U 

three  possibilities  may  be  characterized  by  the  terms  (a)  fast,  (b)  slow, 
and  (c)  intermediate  recombination.  They  occur  in  the  following  circum¬ 
stances  (see  Figure  2) . 

(a)  T.  <  T  <  T  .  Case  (i)  applies  and  the  flame  temperature  has  to 
Jo  u  c 

be  at  its  upper  bound; 
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^  ^  ^u*  applies  and  the  flame  temperature  has  to 

be  at  its  lower  bound; 


(c)  <  T^,  Assuming  either  case  (1)  or  (ii)  leads  to  a  contra¬ 

diction,  so  case  (ill)  applies  and  the  flame  has  to  be  at  the 
cross-over  temperature. 


These  results  are  summarized  by  saying  that  is  driven  as  close  as 
it  can  be  to  T^.  Determination  of  the  flame  temperature  is  a  crucial  step 

in  solving  the  problem.  The  asymptotic  analysis  is  now  similar  to  that  for 
the  one-step  reaction,  and  leads  to  T,  X,  Y  profiles  sketched  in  figure  2, 
as  well  as  formulae  for  the  burning  rate.  It  is  found  that 


The 


M 

M 


for  fast  recombination, 
for  slow  recombination. 


1/2  -2 

M  ot  Dj^  6  e  for  intermediate  recombination. 


two  limiting  cases. are: 


(d)  T  <  T  =  T  .  For  this  fast/intermediate  recombination,  the 

A'  u  c 

profiles  are  similar  to  those  for  the  fast  recombination  case 
except  that  the  radical  concentration  is  much  larger  (but  still 
confined  to  the  flame  sheet) . 

(e)  T  =  Tj  <  T  ,  For  this  intermediate/ slow  recombination,  the 

c  n 

profiles  are  similar  to  those  for  the  slow  recombination  case 
except  that  the  recombination  is  completed  by  x  = 


CONCLUDING  REMARKS.  It  is  now  clear  that  for  ^  (^) 

occur.  But,  for  ^  ^2  have  supposed  above)  there  are  five 

possibilities  depend.lng  on  the  state  of  the  fresh  mixture* 


Fife  and  Nicolaenko  have  considered  a  whole  range  of  multistep  kinetic 
schemes.  Their  method  is  currently  applied  to  the  4-step  model  of  Peters 
&  Smooke  (1985),  which  has  considerably  more  complexity.  Now,  apart  from 
the  24  different  ways  of  ordering  the  four  reaction  rate  constants 
D^,  D^,  D^,  D^,  there  will  be  many  cross'-over  temperatures.  Which  of  these 

the  flame  temperature  is  driven  close  to  is  not  as  clear  as  for  the  two- 
step  case.  But  the  problem  may  hot  be  as  complex  as  it  first  appears. 
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ABSTRACT 

The  method  of  front  tracking  is  applied  to  problems  involving  curved 
detonation  fronts,  surface  instabilities  and  two-dimensional  Riemann  prob¬ 
lems.  The  detonation  problems  include  detonation  fronts  with  and  without 
cylindrical  symmetry;  comparisons  with  one-dimensional  models  are  made, 
llie  analysis  of  interface  instabilities  focuses  on  the  compressible  Rayleigh- 
Taylor  instability  of  a  supersonic  accelerated  contact  discontinuity  between 
two  gases  and  the  propagation  of  a  supersonic  slab  jet.  Theoretical  notions 
for  an  S  matrix  theory  for  general  multi-dimensiond  hyperbolic  conservation 
laws  and  the  numeri^  implementation  of  computer  programs  which  solve 
certain  two-dimensional  Riemann  problems  are  also  discussed. 


1.  Introduction 


Systems  of  non-linear  conservation  laws  in  n  space  dimensions 

o,  +  V  /(x,  u)  =  0 
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where  n  =  o(*,  t)  and  /  is  a  smooth  function  of  the  state  o  t  R*  and  the  position  x  e  R"  into 
R^,  are  often  used  as  first  order  approximations  for  many  natural  phenomena.  Equations  of 
this  type  occur  in  models  in  which  external  forces  and  higher  order  effects  such  as  viscosity 
and  heat  conduction  are  neglected. 


We  are  primarily  concerned  with  the  case  where  system  (1.1)  consists  of  the  Euler 
equations  for  a  compressible,  invisdd,  non*heat  conducting  gas.  In  this  case: 


u  = 


and  /"(u)  = 


m 

m®m 


+P 


(1.2) 


Here  p  is  the  density,  m  and  E  are  the  momentum  and  total  energy  per  unit  volume  respec* 
lively  and  p  is  the  thermodynamic  pressure.  These  equations  represent  the  laws  of  conserva¬ 
tion  of  mass,  momentum  and  energy  respectively.  The  thermodynamic  variables  p,  p  and  E 
are  related  by  a  caloric  equation  of  state  p  =  p(p,  £).  For  the  case  of  a  poly  tropic  gas  this 
relation  is  given  by: 


P  -  (7-I) 


E  - 


2p 


where  y  is  positive  constant  usually  satisfying 


Much  progress  has  recently  been  made  in  adapting  a  front  tracking  method  to  the  calcu¬ 
lation  of  solutions  of  system  (1.1)  which  contain  discontinuities.  In  this  method  a  one¬ 
dimensional  grid  is  placed  onto  the  discontinuity.  Points  on  the  tracked  front  are  propagated 
by  solving  one-dimensional  Riemann  problems  in  the  direction  normal  to  the  interface.  This 
step  provides  the  position  of  the  tracked  interface  for  the  next  time  step.  Tangential  informa¬ 
tion  is  ignored  during  the  normal  propagation  phase,  so  this  step  is  followed  by  a  update  of 
the  states  on  the  new  interface  based  on  the  component  of  (1.1)  tangent  to  the  interface.  The 
positions  of  the  new  and  old  fronts  together  with  their  assigned  states  are  used  as  boundary 
value  data  for  the  solution  of  the  states  off  the  front.  A  detailed  description  of  this  method 
can  he  foimd  in  [1]. 

The  discontinuities  supported  by  tlie  Euler  equations  (1.2)  are  of  two  types,  shocks  and 
contact  discontinuities.  If  combustion  is  considered,  a  third  type  of  discontinuity,  a  combus¬ 
tion  wave,  may  also  occur. 

In  this  paper  we  will  report  on  recent  progress  which  has  been  made  by  the  authors  and 
co-workers  in  modeling  solutions  in  two  space  dimensions  of  the  Euler  equations  for  detona¬ 
tion  waves,  surface  instabilities  for  non-combustion  interactions  and  the  numerical  solution  of 
certain  two-dimensional  Riemaiui  problems.  In  addition  recent  theoretical  work  concerning  a 
general  theory  of  elementary  waves  and  Riemann  solutions  for  systems  of  hyperbolic  conser¬ 
vation  laws  will  also  be  discussed. 


2.  Multi-dlinciisional  RlemaoD  Problems  and  Elementary  Waves 

While  the  theory  of  hyperbolic  conservation  laws  in  one  space  dimension  is  highly 
developed,  the  corresponding  theory  for  two  or  more  space  dimensions  is  not  so  well  under¬ 
stood.  Recent  work  has  been  devoted  to  the  development  of  some  basic  notions  which  can 
be  used  in  an  S  matrix  theory  for  systems  of  hyperbolic  conservation  laws  in  more  than  one 
space  dimension,  see  [2,3,4], 

Let  A  =  —  be  the  Jacobian  matrix  of  f.  Equation  (1.1)  is  said  to  be  hyperbolic  in  a 
domain  D  C  R’'^^  if  the  « x  j  matrix  A  |  has  real  eigenvalues  .  ,  .  ,X^  for  all  (x,  r)  e  £) 
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and  for  all  vectors  | .  If  the  eigenvalues  are  all  distinct,  the  equation  is  said  to  be  strictly 
hyperbolic.  In  die  discussion  which  follows,  hyperbolidty  is  assumed. 

An  S  matrix  theory  is  concerned  with  the  large  time  asymptotic  behavior  of  solutions  to 
systems  of  equations  for  which  system  (1.1)  is  a  first  approximation.  The  leading  order 
terms  of  these  large  time  solutions  are  governed  by  the  infinite  scaling  limit  of  the  original 
system  of  equations.  This  scaling  generally  eliminates  the  higher  order  effects,  yielding  the 
system  (1.1).  It  is  assumed  that  any  source  terms  in  the  original  equation  are  of  bounded 
extend.  Under  scaling  these  source  terms  will  in  general  survive  and  go  into  a  multiple  of  a 
delta  function  at  the  origin. 

An  S  matrix  is  the  product  of  two  wave  operators,  the  outgoing  operator  which 
gives  the  large  positive  time  asymptotics  and  the  incoming  wave  operator  W~  which  gives  the 
large  negative  time  asymptotics.  Attention  will  be  focused  on  the  outgoing  wave  operator 
W*".  The  domain  of  usually  taken  to  be  the  range  of  W~ .  However,  because  of  the 
occurrence  of  shocks  in  solutions  of  (1.1),  this  equation,  when  supplemented  by  the  necessary 
entropy  condition  to  separate  physical  from  nonphysical  waves,  is  not  reversible.  Thus  W~  is 
not  well  defined.  As  a  substitute  the  domain  of  will  be  restricted  to  scale  invariant  func¬ 
tions.  Thus  we  consider  solutions  to  the  initial  value  problem  for  system  (1.1)  whose  initial 
data  is  constant  on  rays  through  the  origin.  In  some  cases  it  will  also  be  desirable  to  impose 
regularity  conditions  on  the  initial  data  as  well.  In  one  space  dimension  this  is  the  well 
known  Riemann  problem  and  the  problem  of  solving  (1.1)  with  scale  invariant  data  will  be 
referred  to  as  a  multi-dimensional  Riemann  problem. 

The  notion  of  dimensionality  in  a  Riemann  problem  is  actually  best  described  in  terms 
of  a  co-dimension.  A  Riemann  problem  of  co-dimension  j  is  defined  as  the  Cauchy  problem 
for  a  system  of  conservation  laws  in  d  space  dimensions  in  which  the  data  is  scale  invariant  in 
j  dimensions  and  independent  in  the  remaining  d  —  j  dimensions. 

The  restriction  to  scale  invariant  data  and  the  fact  that  equation  (1.1)  is  itself  scale 
invariant  implies  that  a  solution  to  a  Riemann  problem  should  be  self-similar,  that  is,  a  func- 

tion  of  This  implies  that  a  Riemann  solution  d  of  (1.1)  will  satisfy 

-  ^-  Vo  V  p  =  0.  (2.1) 

Such  a  solution  n  is  completely  determined  by  its  values  in  the  hyperplane  r  =  1  and  by  res¬ 
tricting  our  attention  to  this  hyperplane  time  can  be  eliminated  from  the  equation.  Therefore 
a  Riemann  solution  of  (1.1)  has  in  general  one  less  degree  of  freedom  than  a  general  solu¬ 
tion. 

General  solutions  for  Riemann  problems  are  known  in  a  few  special  cases.  If  the 
number  of  space  dimensions  is  one,  ^e  system  is  strictly  hyperbolic  and  each  eigenvalue  is 
either  genuinely  non-linear  or  linearly  degenerate,  then  the  classical  paper  of  Lax  [S] 
describes  the  solution  of  a  Riemaim  problem  with  a  small  discontinuity  as  consisting  of 
shocks,  centered  rarefactions  and  contact  discontinuities.  If  the  equation  is  scalar  and  in  two 
space  dimensions,  solutions  are  known  in  the  case  where  F  is  of  the  form  f^,  where  /  is  a 
scalar  valued  function  with  at  most  one  inflection  point  and  is  a  constant  vector  [6,7]. 

A  central  aspect  of  a  scattering  theory  is  that  a  source  decomposes  into  some  number  of 
localized  coherent  waves,  which  then  separate  and  propagate  away  from  each  other.  These 
local  disturbances  are  called  d— dimensional  elementary  waves  if  the  equation  (1.1)  is  in  d 
space  dimensions.  When  d  equals  one  these  elementary  waves  indude  shocks,  contact 
discontinuities  and  rarefaction  waves.  For  d>  1  elementary  waves  are  defined  by  the  interac¬ 
tion  of  lower  dimensional  waves,  for  example  when  two  shocks  or  a  shock  and  a  contact 
discontinuity  collide.  In  many  cases,  such  as  the  interaction  of  two  shock  waves,  these  waves 
will  move  with  a  definite  velodty.  Assuming  that  the  original  equation  (1.1)  is  invariant 
under  Galilean  boosts  one  can  then  make  a  translation  to  a  reference  frame  in  which  the 
wave  is  at  rest,  thus  eliminating  one  more  degree  of  freedom  from  the  equation. 
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Understanding  the  structure  of  these  elementary  waves  is  crudal  in  describing  the  solu* 
tion  of  a  Riemann  problem.  In  some  special  cases  tUs  structure  is  known.  As  mentioned 
above,  in  the  case  of  one  space  dimension,  hyperbolidty  and  a  suitable  type  of  convexity  for 
the  flux  function,  a  theory  of  Lax  describes  Aese  elementary  wave  as  consisting  of  shocks, 
contact  discontinuities  and  rarefaction  waves  which  correspond  to  the  eigenvalues  of  the  dif> 
ferential  of  the  flux  function.  If  the  equation  is  scalar,  then  a  general  theory  of  Oleinik  [8] 
describes  these  waves  in  terms  of  the  convex  envelope  of  the  assodated  flux  fimction.  Other 
spedal  cases  in  which  the  structure  of  elementary  waves  is  known  include  one  space  dimen* 
sion  polytropic  gas  dynamics  [9],  two  space  dimension  polytropic  gas  dynamics  [10],  adsorp¬ 
tion  with  a  Langmuir  isotherm  [11],  and  water  and  polymer  displacement  of  oil  without 
adsorption  [12,13]. 


3.  Two  Dimensional  Detonation  Fronts 


The  method  of  front  tracking  has  been  applied  to  detonation  waves  in  two-dimensional 
gas  dynamics  [14].  For  problems  which  exhibit  cylindrical  symmetry  comparisons  can  be 
made  between  the  two-dimensional  model  and  a  one-dimensional  model  which  exploits  the 
symmetry  and  the  agreement  between  these  two  methods  is  good.  One  finds  that  as  the  mesh 
of  the  computational  grid  is  refined,  the  two-dimensional  model  converges  linearly  to  the 
solution  given  by  the  model  based  on  cylindrical  symmetry. 

Only  a  polytropic  equation  of  state  is  considered,  so  the  energy  term  E  in  (1.2) 
becomes: 


E  = 


+  P9  + 


Here  q  is  the  the  energy  released  by  the  chemical  reaction  that  occurs  across  the  detonation 
front.  The  Chapman-Jouguet  model  of  detonation  is  used.  In  this  model  it  is  assumed  that 
the  reaction  takes  place  instantaneously  and  that  the  reaction  zone  is  infinitely  thin. 

If  state  0  is  the  unburned  gas  and  state  1  is  the  burned  gas,  the  states  on  the  two  sides  of 
the  detonation  are  related  by: 


where  M  is  the  mass  flux  across  the  front,  and  the  Hugoniot  relation: 

7oToPo  7lT]Pi  f  \  (po-pl) 

70-1  7i-l  2 


(3.1a) 


(3.1b) 


In  the  case  of  a  Chapman-Jouguet  (CJ)  detonation,  the  detonation  wave  moves  at  the 
local  sound  speed  with  respect  to  the  gas  behind  it  [15],  and  the  behind  state  is  completely 
determined  by  the  ahead  state  and  equations  (3.1).  When  the  combustion  front  is  a  strong 
detonation  one  additional  parameter  is  necessary  in  order  to  specify  the  behind  state  from  the 
ahead  state. 

A  series  of  both  strong  and  CJ  detonation  runs  using  grid  sizes  of  5  by  5, 10  by  10,  20 
by  20,  40  by  40  and  80  by  80  have  been  made.  The  contact  discontinuity  behind  the  detona¬ 
tion  front  and  the  detonation  front  itself  were  tracked.  Several  of  these  runs  were  initially 
cylindrically  symmetric  and  in  these  cases  comparisons  were  made  with  a  one-dimensional 
computation  using  the  random  choice  method  with  1500  points  in  the  radial  direction. 

Figs.  3.1  -  3.5  present  the  results  of  a  cylindrically  symmetric  computation  in  which  the 
initial  pressure  ratio  across  the  front  is  100.  The  initial  density  is  uniform  and  the  gas 
releases  92.65%  of  its  internal  energy  upon  combustion.  Figs.  3.1  and  3.2  show  the  positions 
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of  the  contact  (iniier  quarter  drde)  and  the  detonation  (outer  quarter  drde)  at  the  beginning 
and  end  of  the  run  respectively.  C^er  figures  indude  comparisons  of  pressure  profiles  and 
detonation  wave  speed  (see  Figs  3.3  and  3.4).  The  detonation  wave  speed  error  when  calcu* 
lated  with  respect  to  the  one>dimensional  code  is  less  than  0.5%.  Fig.  3.S  shows  the  conver¬ 
gence  of  the  front  traddng  code  to  the  one-dimensional  code  under  mesh  refinement. 

In  addition  to  cylindrically  symmetric  runs  the  front  tracking  code  has  also  bemi  applied 
to  problems  in  which  the  initial  interface  is  elliptical.  If  the  initial  states  are  the  same  as  the 
ones  described  above,  hot  spots  arc  produced  behind  the  front  in  regions  of  small  curvature 
and  cold  spots  in  corresponding  regions  of  large  cirvature.  The  initial  lengths  of  the  major 
and  minor  axes  are  .3  and  .15  for  the  detonation  wave  and  .29  and  .145  for  the  contact.  Fig. 
3.6  shows  pressure  contours  and  the  waves  just  before  the  detonation  wave  breaks  through 
the  boundary  on  a  30  by  30  grid.  The  pressure  is  higher  behind  the  flatter  portion  of  the 
detonation  wave  than  behind  the  rounder  portion  of  the  wave. 


4.  Supersonic  Interface  Instabilities 

Interface  fingering  instabilities  arise  in  a  wide  variety  of  physical  contexts;  inertial  laser 
fusion,  plasma  fusion,  instabilities  of  layers  in  stars,  the  instability  of  laser  accelerated  foils, 
and  astrophysical  jets.  We  have  examined  the  compressible  Rayleigh-Taylor  instability  of  a 
supersonic  accelerated  contact  discontinuity  between  two  gases.  The  computed  solutions  exhi¬ 
bit  a  complicated  set  of  nonlinear  waves  comprised  of  spike  and  bubble  bow  shocks,  terminal 
shocks  within  the  spike  and  bubble,  Kelvin-Helmholtz  roll-up  of  the  spike  tip,  and  contact 
siuiace  waves.  Detailed  analysis  is  given  in  Ref.  [16].  We  have  also  studied  the  propaga¬ 
tion  of  a  supersonic  slab  jet  in  order  to  compare  and  contrast  the  jet  wave  structure  with  that 
of  the  supersonic  accelerated  surface. 

A  compressible  gas  interface  which  is  accelerated  by  a  shock  (the  Meshkov-Richtmyer 
instability  [17, 18])  is  Rayleigh-Taylor  unstable.  If  the  interface  is  accelerated  by  a  gravita¬ 
tional  field,  then  the  interface  is  unstable  when  the  light  fluid  pushes  the  heavy.  The  impor¬ 
tant  features  of  this  instability  caii  be  modeled  by  imparting  an  initial  kinetic  energy  to  the 
contact  discontinuity,  which  subsequently  is  allowed  to  advect  freely.  We  assume  that  the 
problem  is  periodic  in  x  with  reflecting  boundaries  at  the  top  and  bottom  of  the  computa¬ 
tional  region. 

The  problem  can  be  parametrized  in  dimensionless  units  by  the  initial  Mach  number  of 
the  tip  of  the  spike  with  respect  to  the  heavy  gas  and  by  the  initial  density  ratio  p(/p„  (6 
denotes  the  gas  below  the  contact,  a  the  gas  above).  The  dimensional  scales  are  set  by  the 
initial  ambient  pressure,  perturbation  wavelength,  and  initial  ambient  density  of  the  heavy 
gas.  The  polytropic  gas  constant  y  was  set  equal  to  1.4. 

An  interesting  set  of  wave  structures  emerges  from  this  study.  Figure  4.1  portrays  the 
evolution  of  a  Mach  2.8  density  ratio  2  accelerated  surface  at  r  =  0.4.  The  flow  is  initially 
supersonic  in  both  gases.  The  bow  shocks  in  the  lower  gas  have  interacted  to  form  a  single 
shock,  while  the  spike  bow  shock  has  interacted  and  joined  with  its  periodic  neighbors.  The 
spike  exhibits  the  characteristic  Rayleigh-Taylor  roll-up,  and  the  contact  shape  indicates  the 
presence  of  small-scale  surface  instabilities. 

Just  inside  of  the  advancing  spike  a  "terminal"  shock  wave  is  formed.  The  contact  is 
advancing  more  slowly  than  the  heavy  gas  inside  of  the  spike.  A  shock  wave,  preceded  by  a 
rarefaction  wave,  is  formed  as  the  advancing  heavy  gas  is  slowed  down  to  the  contact  velo¬ 
city.  A  similar  terminal  wave  is  formed  in  the  light  gas  inside  of  the  advancing  bubble. 

This  shock  preceded  by  a  rarefaction  wave  pattern  can  be  clearly  seen  in  the  density 
cross  section  plots  in  both  the  supersonic  accelerated  surface  run  (Figure  4.1)  and  the  super¬ 
sonic  jet  run  (Figure  4.2).  Note  that  while  the  jet  terminal  shock  propagates  with  the  head  of 
the  jet  beam,  the  accelerated  surface  terminal  shocks  are  physical  transients  which  decouple 
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from  the  late  evolution  of  the  contact  instability. 

The  compressible  Rayleigh-Taylor  results  differ  from  the  incompressible  case  chiefly  in 
the  formation  of  the  terminal  compression  waves  and  in  the  fact  that  the  spike  exhibits  less 
roll-up.  The  accelerated  surface  problem  differs  from  the  gravitational  instability  in  that  the 
spike  appears  to  attain  a  finite  growth  of  aspect  ratio  approximately  equal  to  2  for  our  range 
of  parameters. 

A  resurgence  of  interest  in  supersonic  jete  has  been  sparked  by  the  observation  of  astro- 
physical  jets  emanating  from  the  cores  of  active  galaxies  and  by  the  subsequent  success  of 
theoretical  [19]  and  computational  analyses  [20]. 

The  evolution  of  a  Mach  3,  density  ratio  10  slab  jet  at  r  =  0.4  is  presented  in  Figure  4.2 
[16].  Ute  jet  was  initialized  by  injecting  gas  at  a  specihed  Mach  number  into  an  ambient  gas 
at  equal  pressure.  The  boundary  conditions  are  through-flow.  The  problem  is  parametrized 
by  the  Mach  number  of  the  jet  with  respect  to  the  jet  gas  and  the  density  ratio  of  jet  to 
ambient  gas.  y  was  set  equal  to  S/3.  The  results  apply  to  jets  from  laboratory  to  astrophysi- 
cal  scales  since  the  problem  is  independent  of  length  scale. 

The  jet  beam  in  our  80x120  grid  computation  is  5  grid  blocks  wide,  while  the  beam  is  20 
grid  blocks  wide  in  the  160x300  grid  computation  of  Norman,  Smart,  and  Winkler  [20]. 

The  density  contour  and  cross-section  plots  in  Fig.  4.2  indicate  the  presence  of  a  bow 
wave  (the  flow  is  subsonic  in  the  ambient  gas)  and  of  a  terminal  shock  near  the  head  of  the 
jet  beam,  preceded  by  a  rarefaction  wave.  This  terminal  shock  system  may  explain  the 
observed  hot  spots  terminating  astrophysical  jets  [20].  The  contact  shape  displays  the  large 
scale  Kelvin-Helmholtz  roll-up  of  this  jet,  and  the  development  of  two-dtoiensional  pinch 
waves. 

The  fact  that  we  get  reasonable  results  with  a  beam  5  grid  blocks  across  illustrates  one 
of  the  advantages  of  the  front-tracking  method.  By  placing  additional  degrees  cf  freedom 
around  the  tracked  contact,  the  method  is  able  to  resolve  the  solution  globally  with  fewer 
degrees  of  freedom  than  required  by  conventional  finite  difference  methods.  The  importance 
of  this  feature  of  the  method  will  become  apparent  when  the  statistical  regime  of  multiple 
fingers  is  considered. 


S,  Numerical  Implementation  of  Elementary  Waves 

Further  work  on  the  development  of  computer  code  for  modeling  elementary  waves  is 
in  progress.  Previous  papers  [10, 1]  have  reported  the  numerical  implementation  in  the  front 
tracking  code  of  the  elementary  waves  known  as  regular  reflection  and  single  Mach  reflec¬ 
tion,  this  section  will  discuss  the  case  of  shock  and  contact  discontinuity  interactions. 

The  simplest  model  of  a  shock  and  contact  discontinuity  interaction  consists  of  an 
incident  shock  wave  colliding  with  a  contact  discontinuity  separating  two  different  gases.  The 
local  result  of  this  interaction  is  a  configuration  we  call  a  diffraction  node.  A  diffraction 
node  consists  of  the  incident  shock  wave,  the  contact  discontinuity  into  which  the  incident 
shock  collides,  a  reflected  wave  which  is  either  a  shock  or  a  centered  rarefaction  wave  and  a 
deflected  contact  surface  behind  the  incident  shock.  The  model  supposes  that  locally  all  of 
the  shock  or  contact  waves  can  be  assumed  to  be  straight,  and  that  Ae  solution  in  a  neighbor¬ 
hood  of  the  node  is  piecewise  constant  except  for  the  possible  reflected  centered  rarefaction 
wave.  It  is  assumed  that  the  point  of  intersection  of  these  waves  moves  with  a  definite  velo¬ 
city  and  thus  the  interaction  can  be  studied  in  a  frame  of  reference  in  which  the  node  is  at 
rest.  The  description  of  a  diffraction  node  then  consists  of  the  states  in  a  neighborhood  of 
the  node  together  with  the  angles  at  which  the  waves  at  the  node  intersect. 

In  a  dynamic  model  it  is  necessary  to  calculate  die  transformation  to  the  ffeady  frame  of 
the  node.  This  is  eqtiivalent  to  finding  the  velocity  of  the  node  in  the  given  reference  frame. 
This  velocity  can  be  approximated  by  propagating  the  incident  shock  and  the  contact 


228 


discontinuity  into  which  it  collides  for  one  time  step  ignoring  their  interaction.  The  intersec¬ 
tion  of  the  two  propagated  curves  is  then  used  as  the  updated  node  position  from  which  the 
node  velocity  can  be  calculated.  Once  the  node  velocity  is  known,  Ae  transformation  to  the 
steady  frame  of  the  node  is  performed.  If  one  assumes  that  the  data  in  front  of  the  inrident 
shock  on  both  sides  the  contact  discontinuity  is  known,  and  the  strength  of  the  incident  shock 
is  given,  then  the  configuration  around  the  diffraction  node  in  the  steady  frame  can  be  found 
by  the  intersection  of  shock  polars  in  the  pressure  turning  angle  space,  see  Henderson  [21]. 
Finally  the  configuration  is  translated  back  to  the  original  frame  of  reference. 

Figure  5.1  shows  the  result  of  the  interaction  of  a  planar  shock  wave  colliding  with  a 
sinusoidally  perturbed  contact  discontinuity.  The  incident  shock  is  in  air  (y  =  1.402)  and  the 
contact  surface  separates  air  from  sulphur  hexafluoride  (y  =  1.092).  This  interaction  is 
known  as  a  fast-slow  interaction  since  the  sound  speed  in  air  is  greater  than  that  in  sulphur 
hexafluoride.  The  initial  shock  has  a  pressure  beMnd  to  pressure  ahead  ratio  of  100.  It  is 
interesting  to  note  the  extreme  proximity  of  the  transmitted  shock  and  the  deflected  contact 
discontinuity  near  the  node.  We  were  quite  pleased  with  the  front  tracking  code’s  ability  to 
resolve  a  configuration  with  such  close  curves.  The  rectangular  mesh  used  for  this  run  was 
20x20,  and  the  separation  between  the  transmitted  shock  and  deflected  contact  is  on  the  order 
of  one  tenth  of  a  mesh  block  for  a  large  portion  of  the  computational  region.  Furthermore 
the  transmitted  wave  lies  on  the  sulphur  hexaflouride  side  of  the  contact  discontinuity  and  the 
value  of  gamma  for  this  gas  is  so  close  to  one  as  to  make  the  resolution  of  waves  on 
moderate  sized  grids  difficult  for  most  finite  difference  methods.  We  suspect  that  without 
front  tracking  one  would  need  a  rectangular  mesh  more  than  ten  times  as  fine  in  each  linear 
dimension  in  order  to  resolve  both  the  deflected  contact  and  the  transmitted  wave. 


6.  BiftircatloDS  and  Wave  Interactions 

One  of  the  principal  difficulties  in  any  front  tracking  code  is  the  handling  of  bifurcations 
and  interactions  in  the  tracked  interface.  Examples  of  these  include  the  transition  from  regu¬ 
lar  to  Mach  reflection  and  the  passing  of  waves  through  computational  boundaries.  Interac¬ 
tions  of  both  of  these  types  have  been  either  fully  or  partially  implemented  in  our  front  track¬ 
ing  code. 

Figure  6.1  shows  a  planar  shock  wave  incident  upon  a  ramp.  The  problem  is  initialized 
with  the  shock  normal  to  the  wall.  When  the  ramp  is  reached,  a  bifurcation  occurs,  in  this 
case  to  a  regular  reflection.  At  a  later  time  the  point  of  regular  reflection  reaches  Ae  top  of 
the  ramp.  At  this  point  the  regular  reflection  node  lifts  off  the  wall  producing  a  Mach  type 
reflection. 

In  figures  6.2  the  front  tracking  code  is  used  to  follow  the  development  of  a  compressi¬ 
ble  Kelvin-Helmholtz  roll-up  [1,22].  Initially  two  gases  of  equal  pressure  and  temperature 
but  moving  in  opposite  directions  are  separated  by  a  slip  discontinuity.  This  slip  surface  is 
given  an  initial  perturbation  which  causes  it  to  roll-up.  The  boundary  conditions  at  the  sides 
of  the  computational  rectangle  are  periodic.  As  the  surface  rolls  up  portions  of  the  surface 
cross  the  periodic  boundaries.  Any  section  of  the  surface  which  propagates  past  a  periodic 
boundary  is  disconnected  from  the  original  curve  and  reinstalled  periodically  shifted  to  the 
opposite  side  of  computational  rectangle.  A  linking  between  the  periodically  connected 
curves  is  maintained  so  that  periodic  boundary  conditions  are  enforced.  The  visual  effect  is 
that  as  one  section  of  the  slip  surface  propagates  out  of  the  computational  window,  we  see 
the  corresponding  portion  of  the  periodic  neighbor  moving  into  our  picture. 

Not  only  is  the  interaction  of  curves  with  computational  boundaries  of  interest,  but  the 
interaction  of  nodes  as  well.  Fig.  6.3  shows  a  diffraction  node  propagating  past  a  computa¬ 
tionally  passive  boundary.  This  problem  is  supersonic  and  the  signals  from  the  exterior  of 
the  right  hand  boundary  are  sufficiently  weak  that  their  influence  on  the  solution  can  be 
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ignored.  Thus  the  problem  of  node  passing  through  such  a  boundary  is  simply  a  matter  of 
identifying  those  curves  at  the  exiting  node  which  leave  and  those  which  remain.  F.Titing 
curves  are  deleted  and  the  remaining  curves  are  separately  installed  on  the  boundary.  The 
main  difficulty  here  is  dealing  with  the  numerical  degeneracies  which  occur  because  very 
short  curves  are  produced  when  the  node  first  crosses  the  boundary. 
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Fig.  3.1 

The  initial  strong  detonation  wave  (outer  drde)  and  the  contact  discontinuity  for  a 
cylindrically  symmetric  computation.  The  initid  conditions  are  uniform  density,  zero 
velocity,  and  a  circular  pressure  discontinuity  at  radius  .2,  with  ratio  inside  to  outside  of 
100.  The  heat  released  upon  combustion  is  92.65%  of  the  internal  energy  of  the 
unburned  gas.  The  initial  position  of  the  contact  is  radius  .195.  The  initial  state 
between  the  waves  is  that  behind  a  planar  detonation  wave  with  the  above  initial  data. 
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Fig.  3.2 

The  detonation  wave  and  the  contact  at  the  time  step  analyzed  in  Figs.  3.3  and  3.4.  The 
detonation  wave  now  has  radius  approximately  .36. 
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Fig.  3.3 

A  plot  of  presure  vs.  radius  corresponding  to  Fig.  3.2  is  shown.  The  solid  curve  shows 
the  results  obtained  by  the  one-dimensional  random  choice  computation.  The  vertical 
lined  represent  the  range  of  pressure  values  in  the  two-dimensional  front  tracking  solu¬ 
tion  at  a  fixed  radius  as  the  angle  varies  on  a  40  by  40  grid.  Thus,  the  vertical  lines 
show  the  angular  dependence  in  the  solution. 
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A  plot  of  detonation  speed  vs.  time  for  the  computation  on  a  40  by  40  grid.  The  solid 
curve  shows  the  speed  of  the  detonation  wave  in  the  one-dimensional  calculation.  The 
vertical  lines  represent  the  range  of  values  of  the  speed  of  the  detonation  in  the  two- 

dimensional  calculation.  The  maximum  error  (max  jj  than  0.5%. 
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Fig.  3.5 

Convergence  of  the  front  and  interior  schemes.  The  pressure  errors  in  the  interior  and 
at  the  front  are  shown  for  NxN  grids  at  the  time  indicated  by  Fig.  3.2.  The  #  signs 
represent  the  interior  error,  where 


Interior  Error  =  100%  x 


Jo  Jo 


The  front  error  (error  bars)  gives  the  range  of  the  errors  at  the  front,  defined  as 


Front  Error  =  100%  x 


[P]  ’ 


where  [P]  is  the  pressure  jump  at  the  front  in  the  one-dimensional  computation  at  the 
same  time.  The  asterisks  represent  the  error  of  the  average  pressure  behind  the  front, 
namely 


Front  Error(average  pressure)  =  100%  x 
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averatt 

[P] 


3.6 

Pressure  contours  are  shown  for  a 
a  30  by  30  grid.  Also  shown  are  the 
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contours  and  density  cross  sections  at  t  »  0.4. 
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ty  contours  and  density  cross  section  for  M  =  3,  D  =  10  jet  at  t  =  0.4.  80x120 


BIFURCATION  AND  STABILITY  OF  VISCOELASTIC  FLUID  FLOWS 


s.  Rosenblat 

Department  of  Mathematics 
Illinois  Institute  of  Technology 
Chicago,  Illinois  60616 


ABSTRACT.  Results  on  stability  of  viscoelastic  fluid 
flows  are  presented,  reviewed  and  discussed;  both  linear 
stability  and  nonlinear  stability  (bifurcation)  are 
considered.  A  central  problem  in  viscoelastic  fluid 
dynamics  is  that  predictions  of  flow  behavior  can  in  general 
be  strongly  dependent,  both  qualitatively  and 
quantitatively,  on  the  choice  of  the  constitutive  equation 
used  to  relate  stress  and  deformation  rate.  This  is  true 
even  for  very  simple  flow  configurations,  and  the 
acceptability  or  otherwise  of  a  constitutive  law  depends  on 
whether  its  predictions  are  in  accord  with  experiment.  It 
will  be  shown  in  this  paper  that  stability  results  are  also 
strongly  correlated  with  the  particular  constitutive 
equation  used,  and  some  implications  of  this  fact  are 
discussed.  Detailed  results  are  given  for  the  problem  of 
buoyancy-driven  instability  in  a  quiescent  horizontal  layer. 

1.  INTRODUCTION.  The  area  of  hydrodynamic  stability 
can  conveniently  be  subdivided  into  three  categories;  these 
subdivisions  are  equally  relevant  to  Newtonian  and  non- 
Newtonian  fluid  flows,  and  apply  to  both  theoretical  and 
experimental  approaches . 

A.  Linear  Stability.  The  object  is  to  investigate  the 
stability  to  infinitesimal  disturbances  of  specific,  well- 
defined  basic  flow  states.  In  particular,  the  question  of 
whether  a  given  flow  is  stable  or  unstable  is  usually  posed 
in  terms  of  dimensionless  parameter,  conveniently  denoted  R 
(which  may  be  a  Reynolds  number,  Rayleigh  number,  Taylor 
number,  etc.),  and  the  aim  is  to  determine  a  critical  value 
R„  of  R  which  divides  stable  and  unstable  flow  patterns. 

In  general,  the  basic  flow  is  (asymptotically)  stable  if 
R  <  Rj,  and  unstable  if  R  >  R^.  This  component  of  the  area 

of  hydrodynamic  stability  is  thoroughly  understood  and 
developed  for  Newtonian  fluid  flows. 
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B.  Bifurcation.  The  object  is  to  investigate  the 
weakly  nonlinear  evolution  into  a  new  state  that  is 
associated  with  the  destabilization  of  a  basic  state.  The 
theory  for  this  aspect  is  well  developed  and  applies  in  a 
small  neighborhood  of  the  critical  value  of  R,  that  is,  for 
|R  -  RqI  small.  Theorems  are  available  that  can  predict 

whether  the  new  bifurcation  state  is  supercritical  (that  is, 
exists  for  R  >  R^,) ,  subcritlcal  (exists  for  R  <  R^)  or 

transcritical  (exists  for  both  R  <  Rq  and  R  >  R^,) ,  and 

whether  it  is  stable  or  unstable.  The  new  state  may  be 
time-independent  or  time-periodic,  depending  on  the 
characteristics  of  the  linear  stability  problem,  and  has  a 
spatial  structure  different  from,  and  more  complex  than,  the 
basic  state. 

C.  Transition.  Most  fluid  flows  experience  transition 
to  turbulence  when  the  characteristic  parameter  R  becomes 
sufficiently  large.  The  problem  of  transition  is  currently 
of  great  interest,  and  some  understanding  of  the  process  has 
been  achieved  in  recent  years.  A  number  of  ideas  are  being 
explored  in  this  strongly  nonlinear  regime,  in  particular 
the  notion  that  there  is  a  pre-turbulent,  chaotic  state  that 
appears  when  R  >  >  R^,.  There  are  several  model  equations 

and  a  body  of  theory  in  dynamical  systems  that  support  this 
idea,  but  the  link  with  real  fluid  behavior  is  only 
partially  established. 

2.  STABILITY  OF  VISCOELASTIC  FLUIDS.  It  is  convenient 
to  characterize  the  elasticity  of  a  non-Newtonian  fluid  by  a 
parameter  X (which  may  denote  a  Weissenberg  number  or  a 
Deborah  number) ;  for  the  purposes  of  this  discussion,  A  =  0 
represents  a  Newtonian  fluid  and  X  >  0  a  viscoelastic  fluid. 
This  is  a  gross  simplification;  it  is  rarely  the  case  in 
practice  that  a  non-Newtonian  fluid  can  be  characterized  by 
a  single  parameter. 

As  regards  stability,  it  is  desirable  to  summarize  the 
most  important  issues  that  arise  in  relation  to  viscoelastic 
fluids  by  comparison  with  their  Newtonian  counterparts. 

A.  Linear  Stability.  There  are  two  Issues, 
quantitative  and  qualitative.  The  former  relates  to  the 
question  of  how  the  critical  value  of  R^  is  changed  when  X  0 

in  other  words,  whether  the  presence  of  elasticity  tends  to 
stabilize  or  destabilize  a  given  flow  field,  or  to  have  no 
effect.  This  problem  has  been  studied  by  a  great  many 
authors  in  different  flow  configurations  (see  for  example 
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the  survey  by  Pearson  [1]).  The  other,  perhaps  more 
important,  question  is  whether  new  instabilities  can  appear 
when  X  ^  0  which  are  completely  absent  when  X  “  0.  It  is  of 
special  significance  to  know  whether  such  instabilities  can 
arise  for  R  <  <  Rq.  If  they  do,  it  may  be  associated  with  a 
release  of  elastic  energy  stored  in  the  basic  flow  to  feed 
the  disturbances.  More  recently  it  has  been  suggested  by 
Joseph  and  co-workers  [2]  that  such  an  effect  can  be  due  to 
the  change  of  type  from  elliptic  to  hyperbolic  of  the 
governing  differential  equations, 

B.  Bifurcation.  The  main  issue  here  is  qualitative, 
whether  the  presence  of  elasticity  alters  the  nature  of  the 
bifurcating  solution.  In  particular  it  would  be  of  interest 
to  know  whether  a  solution  that  is  supercritical  and  stable 
in  the  Newtonian  case  can  become  subcritical  and  unstable  in 
the  corresponding  non-Newtonian  case,  or  conversely.  An 
example  of  this  will  be  discussed  in  detail  below.  Another 
interesting  question  is  whether  the  geometry  in  space  of  a 
bifurcating  solution  is  changed  due  to  viscoelasticity. 

C.  Transition.  Although  most  practical  viscoelastic 
fluid  flows  operates  at  relatively  low  Reynolds  numbers,  it 
would  nevertheless  be  important  to  know  how  the  transition 
to  turbulence  occurs  in  such  fluids.  This  issue  has  hardly 
been  addressed  at  all  and  the  answers  are  unknown. 

3.  CONSTITUTIVE  RELATIONS.  The  greatest  obstacle  to 
understanding  the  behavior  of  viscoelastic  fluid  flows  is 
lack  of  certainty  regarding  constitutive  relations.  For  the 
purposes  of  solving  problems  one  is  obliged  to  use 
relatively  simple  relations  rather  than  general  (e.g. 
functional)  laws,  but  then  one  encounters  the  difficulty 
that  a  suitable  relation  for  a  particular  flow  of  a  fluid 
may  not  be  suitable  for  a  different  flow  of  the  same  fluid. 
In  addition,  of  course,  different  fluids  generally  require 
different  constitutive  laws.  In  the  study  of  viscoelastic 
fluid  stability  there  is  already  evidence  that  predictions, 
including  qualitative  ones,  may  be  relation-dependent,  and 
further  examples  of  this  will  be  given  below.  This, 
however,  should  be  regarded  as  a  positive  rather  than  a 
negative  feature,  since  when  coupled  with  experiments  the 
theory  can  be  used  as  a  yardstick  for  the  validity  or 
otherwise  of  a  particular  constitutive  equation  for  certain 
classes  of  flows.  A  good  example  is  the  work  of  Craik  [3] 
who  showed  that  the  rest  state  of  a  second-order  fluid  was 
unstable,  which  implies  that  the  second-order  model  should 
not  be  used  for  unsteady  flows. 

4.  STABILITY  OF  SHEAR  FLOWS.  There  has  been  much  work 
(almost  exclusively  linear  theory)  on  the  stability  of 
various  kinds  of  rectilinear  and  circular  shearing  flows. 


247 


and  several  surveys  are  available,  including  that  by  Pearson 
[1]  and  Petrie  and  Denn  [4].  A  good  deal  of  this  work  has 
been  motivated  by  practical  problems  relating  to  flows  of 
polymer  solutions  and  polymer  melts. 

A.  .Circular  Couette  Flow.  This  is  the  flow  between 
concentric  rotating  cylinders,  a  classical  problem  in 
Newtonian  fluid  stability,  and  of  relevance  to  the  process 
of  screw  extrusion.  There  have  been  several  calculations  of 
linear  stability  for  various  constitutive  relations  [5-7], 
and  the  overall  conclusion  is  that  for  any  physically 
acceptable  constitutive  model  the  effect  of  viscoelasticity 
is  stabilizing;  in  other  words  the  critical  parameter  (the 
Taylor  number)  has  a  higher  value  than  in  the  Newtonian 
case.  The  onset  of  instability  is  through  the  exchange  of 
stabilities  in  these  analyses.  Under  certain  conditions, 
however,  [8-9]  it  appears  that  overstability  is  possible. 
There  has  been  some  work  on  nonlinear  instability  for  this 
problem  [10],  but  only  for  very  special  constitutive 
equations. 

B.  Plane  Couette  Flow.  This  is  the  simplest 
viscometric  flow,  generated  by  differential  motion  of 
parallel  planes.  In  the  case  of  a  Newtonian  fluid  the  flow 
is  stable  for  all  Reynolds  numbers  to  Infinitesimal 
disturbances.  There  have  been  many  studies  of  the  stability 
of  this  flow  for  various  viscoelastic  models,  with  the  aim 
of  determining  an  instability  at  low  Reynolds  number  but 
dependent  on  Weissenberg  number.  Although  there  are  some 
positive  results  [ii],  it  is  not  at  all  clear  that  there  is 
a  viscoelastic  instability  in  this  flow  for  a  meaningful 
constitutive  relation  [12]. 

C*  Plane  Poiseuille  Flow.  This  is  the  flow  in  a 
channel  under  a  constant  pressure  gradient.  A  number  of 
studies  of  the  stability  of  this  flow  [13-18]  for  various 
constitutive  models  have  indicated  that  in  general  the 
critical  Reynolds  number  is  decreased  by  the  presence  of 
elasticity;  a  destabilizing  effect.  The  degree  of  the 
effect  depends  on  the  model.  There  have  also  been  attempts 
to  identify  the  presence  of  a  new  instability  at  low 
Reynolds  numbers  [18-22],  with  a  view  to  elucidating  the 
important  phenomenon  of  melt  fracture.  Although  such 
instabilities  are  predicted  by  theory,  the  results  are  for 
special  constitutive  models  and  do  not  in  general  agree  with 
any  experimental  results,  leaving  the  question  of  such 
instabilities  open.  Some  work  has  been  done  on  weakly 
r^orilinear  theory  [23—24],  but  the  conclusions  do  not  appear 
to  be  particularly  significant. 

THERMAL  CONVECTION.  To  demonstrate  some  of  the 
issues  raised  earlier,  we  shall  discuss  in  detail  the 
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'thenoal  convection  (Benard)  problem  for  a  viscoelastic 
liquid.  This  is  the  problem  of  a  horizontal  liquid  layer 
heated  from  below;  density  varies  linearly  with  temperature 
and  buoyancy  drives  convection.  In  the  basic  state  there  is 
no  fluid  motion  and  a  linear  temperature  distribution  across 
the  layer.  The  governing  equations  are  Navier-Stokes, 
energy,  continuity  and  a  constitutive  relation.  The  liquid 
is  of  infinite  horizontal  extent,  and  the  parallel 
horizontal  boundaries  are  taken  to  be  isothermal,  non- 
deformable  and  stress-free. 

The  equations  (dimensionless)  for  arbitrary 
disturbances  to  the  basic  state  are 

Pr  ^ (v^  +  v*Vv)=  -Vp  +  V*T  +  Re^  (1) 


V*v  =  0  (2) 

9^  +  v»V0  =  V^9+w  (3) 


where  y  «  (u,v,w)  is  velocity,  0  is  temperature,  p  is 
pressure,  t  is  extra  stress,  t  is  unit  vector  in  the 

vertical  direction,  and  R,  Pr  are  respectively  the  Rayleigh 
nuitiber  and  Prandtl  number  defined  in  the  usual  way,  with  the 
zcro-shear-rate  viscosity  as  the  reference  value.  The 
boundary  conditions  are 


G=:u^  =  v^  =  w=  0  onz  =  0,l.  (4) 

To  investigate  the  consequences  of  various  models,  we  use  a 
hybrid  constitutive  relation  which  includes  several  familiar 
relations  as  special  cases.  We  write 

j+\  [Ktrx  +  Dt  -■  ^a(T*;Y^+Y*T)J  =  i+exj^Dy  -  x*i]  (5) 

where  y  ■  Vv  +  Vv^  is  the  rate  of  strain  tensor,  and  D  is 
the  Jaumann  derivative  defined  by 

D  =  -^  +  v*V  +  -  .w) 

Where  w  =  Vv- Vv  is  the  vorticity  tensor.  Equation  (5)  has 
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four  parameters:  X  is  the  Deborah  number,  a  measure  of 
relaxation  time;  £(0  £  e  <  1)  is  related  to  the  retardation 
time;  a(0  £  a_<l)  delineates  corotational  and 
codeformational  models;  k(£0)  is  a  parameter  obtained  from 
network  theory  arguments.  When  < »  0,  equation  (5)  is  an 
Oldroyd-type  model,  specifically,  a  Maxwell  model  when  e  =  0 
and  a  Jeffreys  model  when  e  0.  The  model  is  corotational 
when  a  =  0  and  upper  convected  when  a  *=  1.  When  k  0 
and  e  =  0  equation  (5)  becomes  the  Phan  Thien-Tanner  model 
[25]. 


We  now  consider  the  various  stability  aspects  of  this 
problem. 

A.  Linear  Stability.  This  problem  was  fully  resolved 
by  Sokolov  and  Tanner  [26].  When  the  nonlinear  terms  in 
equations  (1),  (3)  and  (5)  are  deleted,  and  the  time 
dependence  is  taken  to  be  proportional  to  exp  (a  t) ,  the 
problem  reduces  to  an  eigenvalue  problem  for  the  growth  rate  a 
of  disturbances.  It  should  be  noted  that  in  the  linear 
approximation  the  parameters  a  and  k  are  irrelevant;  only 
the  elasticity  parameters  enter  the  problem.  Loss  of 
stability  occurs  when  a  =  0  (exchange  of  stabilities)  or 
when  Re  a  =  0  (overstability) . 

The  results  can  be  summarized  as  follows.  In  the  case 
of  exchange  of  stabilities  (a  =  0)  the  Rayleigh  number  for 
marginal  stability  is 


^(S)  ,  2  ^  2.3,  2 

R''  =  (ir+a)/a  (6) 

where  a  is  the  wave  ntimber  of  the  disturbance.  The  critical 

value  is  R^®^  -  27itV4  at  a  **  ir^/2.  This  is  exactly  the 

o 

result  for  the  Newtonian  problem,  which  means  that  there  is 
an  instability  that  is  independent  of  elasticity.  In  the 
case  of  overstability  (a  =*  iw)  there  is  marginal  stability 
when 

R^^^  =  R^^^  -  cw^  ^  (cX+l)Pr”^  +  eXcJ  /a^  (7) 

with  , 

0)2  =  cX(l-e)-l-Pr- 

X^(Pr"Se) 
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where  c  =  ir^  +  .  This  result  depends  on  the  elasticity 

parameter  (as  well  as  on  Prandtl  number) ,  and,  as  (8)  shows, 
can  only  occur  if 

X(l-e)  >  (l+Pr"^)/c  .  (9) 


In  other  words,  this  is  a  new  instability  occurring  only  at 
sufficiently  high  elasticity.  It  is  obvious  from  (7)  that 

<  R^®^  at  any  fixed  wave  number  at  which  overstability 
takes  place;  this  means  that  the  first  onset  of  instability 

may  be  periodic  rather  than  steady.  A  ‘graph  of  R^^^  and 
r(P) 

as  a  function  of  wave  number  is  shown  in  Figure  1.  The 

solid  curve  represents  R^®) ,  the  broken  curves  r(P)  for 
various  parameter  values. 


Neutral  stability  curves  for  steady  onset  (solid  lines)  and 
periodic  onset  (broken  lines) . 
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B.  Bifurcation.  A  partial  study  of  this  problem  was 
performed  by  Eltayeb  [27].  Looking  only  for  two-dimensional 
weakly  nonlinear  solutions  we  can  introduce  a  stream 
function  \ij  such  that 


(10) 


The  governing  equations  (1)  -  (3)  reduce  to 

+  J(V^ip,ip)  1+  Re^  -  N  =  0 


(11) 


+  j(e,ij^) 


0 


(12) 


where  J  is  the  Jacobian  and  where  N  is  a  combination  of 
stress  components,  namely 


N 


_3i 

3x3z 


(13) 


The  first  term  on  the  right  of  (13)  is  a  contribution  from 
normal  stress  differences  while  the  second  relates  to  shear 
thinning.  Equations  (10)  -  (12)  have  to  be  supplemented  by 
the  two-dimensional  forms  of  the  nonlinear  constitutive 
relations  (5) , 


The  problem  is  solved  by  perturbation  in  terms  of  a 
small  bifurcation  parameter  y.  We  write 


R  -  R  = 
c 


(14) 


where  is  the  critical  value  for  either  steady  or  periodic 
onset.  A  solution  that  exists  for  R2  >  0  is  called 
supercritical  and  is  called  subcritical  if  it  exists  for 
R2  <0.  All  field  quantities  are  expanded  in  powers  of  y, 
for  example 

Q  =  y0^+y^62'‘'^^®3  +•••  (15) 
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The  quantity  has  the  form 


9  =  A  cos  ax  sin  ttz 


(16) 


where  A  is  the  amplitude,  to  be  determined.  A  standard 
perturbation  procedure  leads  to  a  solvability  condition  at 

order  y  ^ . 

We  summarize  the  results  for  the  case  of  steady  onset 
of  convection.  Details  and  results  for  the  case  of  periodic 
onset  can  be  found  in  [28].  The  solvability  condition  leads 
to  an  equation  of  the  form 

R2A  -  KA^  =0  (17) 


where  K  is  a  constant  that  depends  on  all  parameters  of  the 
problem.  The  bifurcating  solution  will  be  supercritical  and 
stable  if  K  >  0,  and  will  be  subcritical  and  unstable  if 
K  <  0.  In  the  Newtonian  case  it  is  known  that  K  =  p  >0, 
and  the  solution  is  supercritical  and  stable.  In  the  present 
problem  we  compute  an  expression  for  K,  namely 

K  =  p  +  (1-e)  [  m  -(a^-2aK)n]  (18) 


where  m,n  are  positive  numerical  constants.  It  follows  from 
(18)  that  there  are  values  of  the  viscoelastic  parameters 
which  allow  subcritical  bifurcation.  We  find,  for  example, 
that  if  a  >  0.75  (close  to  the  upper  convected  model), 
ic  =  0  and  X  >  0.03  with  e  =  0,  then  subcritical  bifurcation 
occurs, ^  Other  combinations  give  the  same  result,  but  if  a 
is  sufficiently  small  or  k  sufficiently  large  then  only 
supercritical  bifurcation  can  occur. 

Results  such  as  these  may  be  a  partial  test  of  the 
viability  of  a  proposed  constitutive  relation  for  a 
particular  liquid.  Thus  Liang  and  Acrivos  [29]  studied 
onset  of  convection  in  a  polyacrylamide  solution  and  found 
only  supercritical  bifurcation.  This  observation,  coupled 
with  the  formula  (18) ,  would  exclude  certain  models,  or 
certain  parameter  ranges,  for  this  liquid. 
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FIGURE  2 

Schematic  bifurcation  diagram  for  (a)  supercritical  bifurcation  and 
(b)  subcritical  bifurcation.  Solid  lines  represent  stable  solutions 
and  broken  lines  represent  unstable  solutions. 


C.  Transition.  A  formal  study  of  transition  has  not 
yet  been  achieved,  even  in  Newtonian  fluids.  In  recent 
years,  however,  there  has  been  great  interest  in  a  severely 
truncated  Fourier  series  representation  of  the  field 
quantities  that  leads  [30]  to  a  system  of  three  nonlinear 
ordinary  differential  equations  having  interesting 
properties.  These  equations,  known  as  the  Lorenz  system, 
exhibit  aperiodic  (chaotic)  solutions  that  may  be  a  valid 
behavioral  model  of  how  transition  takes  place.  It  is  of 
interest  to  derive  the  analogous  system  for  the  viscoelastic 
problem,  and  to  study  its  properties. 

To  do  this  we  set 

0  =  cos  ax  sin  ttz  +  A2  sin  airz,  if)  =  sin  ax  sin  irz 


N  =  sin  ax  sin  irz 


(19) 


in  the  equations  of  motion  and  the  constitutive  relations, 
and  then  truncate  to  retain  only  those  Fourier  components 
exhibited  in  (19).  The  quantities  A2/  are  time- 

dependent  amplitudes.  The  procedure  leads  to  a  system  of 
four  ordinary  differential  equations,  namely. 


Pr“^cB| 


+  aRA,  +  M, 


0 


A^+cA^  +,aB^  +  T7aA2B^  =  0 
A^+4ir^A2  -  ^-TraA^Bj^  =  0 

Mi+XM|  -  c^(bj^+eXBp  =  0  (20) 


It  is  noteworthy  that  of  the  four  elasticity  parameters  in 
the  original  system,  only  two  (x  and  e)  appear  in  (20) 7  this 
is  due  to  the  form  of  the  truncation.  Also,  when  X  “  0  the 
above  reduces  precisely  to  the  three  equations  of  the  Lorenz 
system. 

The  linear  and  weakly  nonlinear  stability  analysis  of 
the  null  solution  of  (20)  gives  the  same  results  as  obtained 

for  the  full  system  above.  When  X(1  -e)  <  (1  +  Pr"^)/c 
there  is  a  steady  bifurcation  from  R  =  R^,  and  in  this  case 

the  solution  is  supercritical.  When  X(1  -  e)  >  (1  +  Pr”^)/c 
there  is  a  periodic  bifurcation  which  is  also  supercritical 
and  unstable.  The  behavior  away  from  the  neighborhood  of 
the  bifurcation  point  has  to  be  determined  by  numerical 
integration  of  (20) .  The  results  can  be  summarized  as 
follows: 

(i)  When  (1-e)  <  (1+  Pr“^)/c  the  steady  bifurcating 
solution  loses  stability  at  a  value  of  R  that  depends  on  Pr 
and  the  elastic  parameters,  and  a  periodic  solution 
bifurcates  from  it.  In  the  Newtonian  case  (X  =  0)  this 
occurs  at  R/R^  24  when  Pr  =  10,  and  the  periodic  solution 

is  subcritical.  Chaotic  solutions  are  found  in  regions  of 
parameter  space  where  no  stable  solutions  exist,  as  shown  in 
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Figure  3,  In  the  viscoelastic  case,  the  steady 
supercritical  solution  loses  stability  at  a  value  of 

less  than  in  the  Newtonian  case;  in  fact  this  value 
decreases  towards  R/Rq  “  1  as  X(1  -  e)  (1  +  Pr"^)/c.  In 

addition  the  bifurcating  periodic  solution  begins  as 
supercritical  and  stable,  but  then  turns  around,  as  shown. 
Chaotic  solutions  have  been  computed  cfuite  close  to  the 
original  bifurcation  point  R  »  R^  for  the  appropriate  values 

of  the  elasticity  parameters. 

(ii)  When  X (1  -  e )  >  (1  +  Pr”^)/c  the  periodic  solution 
emerging  from  the  null  solution  is  supercritical  and  stable, 
and  remains  so;  it  does  not  lose  stability.  Moreover  in 
this  case  there  appears  to  be  no  chaotic  solutions  at  all, 
as  far  as  we  can  determine. 


FIGURE  3 

Schematic  bifurcation  diagram  for  equations  (20) .  Stable  steady 

solutions  are  denoted  by  - ,  vinstable  steady  solutions  by 

- —  ,  stable  periodic  solutions  by  ■  I  I  I  I-  ,  unstable  periodic 

solutions  by  -H*  -H-  -H-. 
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Thus  elas'tlcl'ty  acts  in  two  different  ways  in  the 
system  (20) :  it  first  advances  the  position  where  chaotic 
solutions  appear  and  then,  for  higher  values,  it  seems  to 
suppress  them  altogether.  This  phenomenon  warrants  further 
study. 


6.  SUMMARY.  It  is  evident  that  both  qualitative  and 
quantitative  changes  can  occur  in  stability  problems  for 
viscoelastic  flows  by  comparison  with  their  Newtonian 
counterparts,  but  that  the  results  are  often  strongly 
dependent  on  constitutive  relations.  This  fact  can  be  used 
as  a  partial  test  for  the  validity  of  a  constitutive 
hypothesis  when  coupled  with  appropriate  experiments. 

In  the  particular  case  of  the  thermal  convection 
problem  it  is  found  that  a  new,  periodic  Instability  of  the 
null  solution  may  appear  in  a  sufficiently  elastic  fluid, 
and  also  that  the  bifurcation  into  convection  may  be 
subcritical  for  certain  constitutive  relations.  The 
detailed  formulas  demonstrated  above  could  be  used  as 
measure  of  fluid  behavior  and  properties. 
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CRACK  SOLUTIONS  AND  HUCTTLE  FRACTURE  CRITERIA 


Dennis  n.  Tracf?y  an'^  Colin  E.  Freese 

Mechanics  of  MeteriaJs  Branch 
Army  Materials  and  Mechanics  Research  Center 
Watertown,  Massachusetts  R?.1 

ABSTRACT.  Numerical  solutions  for  a  group  of  elastic-p] astic 
crack  problems  are  discussed.  The  problems  consider  blunted 
flaws  in  infinite  plates  under  remote  monotonically  increasing 
tension.  The  solutions  were  obtained  using  a  combined  finite 
element-analytic  stress  function  formulation.  Results  related  to 
critical  stress  and  energy  release  rate  theories  of  ductile 
fracture  are  discussed. 

I.  INTRODUCTION.  The  results  presented  in  this  report 
supplement  other  results  presented  by  tbe  authors  on  the  topic  of 
elastic-plastic  stress  states  near  the  ends  of  blunted,  cracklike 
flaws,  Ref.n,2).  The  inclusion  of  crack  tip  geometry  in 
analytical  studies  is  a  complication  which  has  been  largely 
avoided  in  the  mechanics  of  fracture  literature.  Noteworthy 
exceptions  are  the  papers  of  Rice  and  Johnson(l)  and  MoMoekingM) 
which  have  considered  the  finite  deformation  effects  at  the  tip 
of  a  sharp  crack.  The  goal  of  our  work  has  been  to  determine  the 
important  aspects  of  the  elastic-plastic  crack  solution  related 
to  the  presence  of  the  traction  free  crack  tip  surface  so  that 
guidelines  to  the  applicability  of  sharp  crack  solutions  to 
ductile  fracture  problems  can  be  established. 
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Our  work  has  concentrate'^  on  the  character  o<^  the  near  tip 
elastic-plastic  stress  and  strain  fields  and  how  these  compare 
with  the  results  of  sharp  cracks.  Specific  models  treated  have 
been  flaws  with  semicircular  an^  circular  tips  (U  an'^  keyholel 
and  the  very  long  and  narrow  elliptical  flaw. 

In  Ref.  (2)  we  reported  the  intriguing  result  that  the  maximum 
stress  of  the  logarithmic  spiral  slipline  solution  is  not 
achieved  for  either  the  U  or  keyhole  tip  due  to  a  limited  range 
of  plasticity  along  the  tip  surface.  This  is  an  important 
finding  with  implications  related  to  critical  stress  theories  of 
fracture  such  as  discussed  hy  Ritchie,  Knott  and  Rice(R).  Here 
we  provide  results  indicatinq  the  range  of  app] i cabi 1 i ty  of  the 
fully  plastic  slipline  solution  and  the  character  of  the  elastic- 
plastic  stress  r^i  str  i  but  i  on  for  the  cracklike  ellipse. 

Another  important  aspect  of  the  solutions  which  relate  to 
energy  theories  of  fracture  is  the  variation  of  the  J  integral 
near  the  crack  tin,  Ref.(f).  J  is  a  path  integral  defined  in 
terms  of  the  strain  energy  density  '"'(elf  displacement  grar’ient 
and  stress  traction  along  an  arbitrary  path  L  which  starts  at  a 
point  on  the  lower  crack  flank  and  ends  at  a  point  on  the  upper 
crack  flank,  h'ith  the  crack  length  in  the  x  direction,  s 
arclength  increasing  counterclockwise,  n  outward  pointing  unit 
normal,  T  the  traction  vector  and  u  the  displacement  vector, 

J  =  J*  (Wn  -  T  •  3u/9x)ds 


J  was  introduced  hy  Rice (7)  who  demonstrated  that  for 
constitutive  theories  which  have  stress  derivable  from  w,  l  is 
path  independent  and  equal  to  the  rate  of  decrease  of  potential 
energy  with  respect  to  crack  length.  As  such,  J  can  be  expected 
to  characterize  the  severity  of  the  crack  tip  deformation  field. 
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while  incipient  crpck  extension  mi'jht  be  exper  imente]  ]  y 
correlated  and  predicted  in  application  in  terms  of  a  material 
dependent  critical  value  of  J. 

A  strain  ensrqy  density  function  does  not  exist  for  materials 
which  yield  and  flow  accordinq  to  the  Prandtl-Reuss  theory.  This 
is  the  theory  most  commonly  employed  in  studies  of  metal 
deformation  and  it  was  used  in  this  work.  However,  if  throuqhout 
the  domain  proportional  stressinq  results,  the  solution 
corresponds  to  the  nonlinear  elastic  solution  of  the  problem.  Of 
course  a  strain  energy  function  does  exist  for  nonlinear 
elasticity.  This  fosters  interest  in  evaluating  J  with  W  defined 
as  the  stress  working  densitv 


£ 

f  2- 

•^0 

The  interest  lies  in  establishing  the  degree  of  path  dependency 
of  J  for  the  elastic-plastic  case  and  determining  how  the  crack 
tip  value  differs  from  the  value  calculated  over  paths 

within  the  elastically  deformed  material  away  from  the  tip.  ^ 
computations  have  been  performed  for  the  U-tip  flaw  and  results 
are  discussed  below. 

II.  NUMERICAL  FORMULATION.  The  flaw  of  our  study  is  taken  to  be 
crackline  with  a  length  ?a  greatly  exceeding  the  root  radius  of 
curvature p  .  The  flaw  is  isolated  within  an  infinite  metallic 
sheet  which  is  under  plane  strain  constraint  with  a  tensile  load 
applied  far  from  the  flav/.  The  metal  deforms  accordinq  to  the 
Prandtl-Reuss  equations  which  have  linear  elastic  response  within 
the  Mises  yield  surface  and  plastic  flow  following  the  normality 
rule.  These  constitutive  relations  are  incremental  in  form  which 
means  that  the  solution  to  our  nonlinear  problem  must  be  obtained 
by  progressino  stepwise  along  the  load  path.  The  nonhardeninq 
model  is  used,  so  that  plastic  flow  occurs  under  a  constant  value 
of  equivalent  stress  equal  to  the  material's  uniaxial  yield 
stress  Y.  Besides  the  yield  stress,  the  only  other  material 
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properties  entering  the  analysis  are  Young's  modulus  K  and 
Poisson's  ratio  v  ,  and  the  latter  was  taken  equal  to  n.l.  For 
general  applicabi 1 i ty ,  the  results  are  presented  in  dimensionless 
form  in  terms  of  Y  and  E. 

The  analysis  was  conducted  using  a  numerical  formulation 
designed  for  this  problem  from  aspects  of  the  finite  element  and 
stress  function  boundary  collocation  methods.  Tn  a  region 
surrounding  the  flaw  root,  where  plastic  deformation  is 
anticipated,  finite  element  approximations  are  made  for  the 
displacement  field.  Over  the  remainder  of  the  infinite  domain, 
the  response  is  elastic  and  is  represented  by  an  analytic  stress 
function  power  series  approximation.  Boundary  collocation 
techniques  are  used  to  couple  the  equations  governing  in  the  two 
regions.  At  each  step  of  the  analysis,  discrete  unknowns  are 
nodal  displacement  increments  in  the  finite  element  region  and 
coefficients  of  the  pov/er  series  in  the  elastic  region. 

The  formulation  has  been  discussed  in  Ref.(l).  It  follows 
from  the  work  of  Bowie  and  Freese (B)  on  mapping-collocation 
techniques  using  elastic  stress  function  theory  and  the  work  of 
Tracey  and  Freese(9)  on  el ast i c-plast i c  finite  element  analysis. 
Its  appeal  lies  in  the  ease  by  which  infinite  regions  can  be 
accommodated  with  discretization  necessary  only  at  the  ends  of 
the  flaw.  Mapping  and  analytic  continuation  are  used  so  that  the 
traction  free  crack  boundary  condition  is  implicitly  satisfied. 
There  is  no  need  for  di scretization  and  collocation  along  the 
crack  surface  in  the  elastic  region. 

The  standard  mapping  function  which  transforms  the  ellipse 
with  semi-axes  a,b  to  the  unit  circle  |c|=l  in  a  ?  parameter 
plane  was  used  in  the  elastic  formulation.  The  U-tip  and  keyhole 
problems  were  treated  by  considering  the  flaws  as  slits  (b=91  in 
the  elastic  region.  The  finite  element  mesh  served  to  define  the 
crack  tip  shape  in  the  problems.  In  the  U-tip  problem,  within 
the  finite  element  region  the  slit  opens  to  a  parallel  faced. 
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semi-circular  enf’erl  slot  of  lenqth  ?Plp  ;  wbile  in  the  keyhole 
problem,  a  split  circular  boun'^ary  was  connecter’  to  the  en'is  of 
the  slit.  While  a  more  natural  U-tip  mo<iel  woul<^  have  a  uniform 
opening  along  the  entire  crack  length,  the  mapping  function  which 
woulrl  transform  this  flaw  onto  the  unit  circle  is  not  known. 
Nonetheless,  there  is  little  reason  to  expect  that  this  case 
would  have  a  solution  very  much  different  than  the  case 
considered. 

Whereas  the  general  planar  elasticity  problem  requires 
determination  of  two  analytic  stress  functions,  continuation 
reduces  the  problem  to  finding  a  single  function  which 

satisfies  the  remote  stress  condition  and  the  equilibrium  and 
compatibility  conditions  along  the  finite  element  interface. 

While  formally  there  is  an  interface  encircling  each  end  of  the 
flaw,  symmetry  allows  treatment  of  a  single  quadrant  of  the 
plane. 


The  interface  is  defined  in  the  auxiliary  plane  as  a  circle 
of  radius  R  centered  at  the  ends  of  the  slit,  so  that  the  problem 
in  the  elastic  region  is  to  find  <P  outside  the  disks  |^±  j  j< 
R.  The  interface  maps  onto  a  smooth  non-circular  contour  endinq 
on  the  faces  of  th“  ellipse  (slit).  For  the  elliptical  flaw  R 
was  chosen  to  have  the  interface  a  distance  l^*>p  from  the  tip  at 
70°  from  the  length  direction.  The  circular  tip  problem  had  R 
chosen  so  that  the  interface  was  at  a  distance  of  75p  at  70°. 


The  nature  of  formulation  is  such  that  the  comnutational  task 
increases  with  the  value  of  R  in  that  a  larger  region  must  he 
discretized  when  H  is  increased  while  a  higher  load  level  can 
then  be  accommodated.  The  maximum  load  that  can  he  incrementally 
reached  is  determined  in  the  course  of  the  analysis  according  to 
when  the  plastic  zone  exten'^s  to  the  interface. 
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The  power  series  enproxinet ion  ^or  <()  ( c )  thet  was  chosen 

consists  of  two  p^irts.  The  first  part  correapon^is  to  the  knov;n 

solution  for  an  elliptical  flaw  in  a  Purely  elastic  tension 

field.  The  second  part  serves  to  represent  the  perturbation  from 

this  elastic  solution  near  the  root.  In  terms  of  the  remote 

stress  increment  aT  ^nd  undetermined  coefficients  a  .  the 

n 

approximation  has  the  form 


<KC)  =  at  {(a  +  b)^  -  (3a  +  b)/c}/8  + 


n=l 


I)"" 


The  first  term  is  expected  ^o  adequately  represent  the  solution 

far  from  the  plastic  zone.  The  additional  series  is  expanded 

from  the  flaw  ends  ? =+1  and  its  terms  vanish  at  infinity, 

consistent  with  its  role  of  representinq  the  local  deviation  from 

the  exact  elastic  solution.  There  are  m  coefficients  a  ,  and 

n 

they  are  real  numbers  due  to  the  conditions  of  symmetry  across 
the  X  and  y  axes.  '^u^ficient  accuracy  was  tound  in  the  analyses 
reported  here  by  usinq  2n  terms  in  the  series  approximation. 

Reasonably  fine  element  qrids  were  used  in  the  analyses.  ^ 
mesh  consisting  of  approximately  l?hfl  nodes  and  triangular 

elements  was  used  in  the  elliptical  flaw  problem,  while  for  the 
other  problems  the  mesh  consisted  of  approximately  indd  nodes  and 
200n  elements.  Element  dimensions  in  each  case  increased  from 
the  tip  and  at  the  tip  edge  lengths  were  less  than  p  /10. 

The  usual  stiffness  aoproach  employing  the  principle  of 
virtual  work  was  used  to  assemble  the  governing  equations  in  the 
finite  element  region.  The  load  vector  in  this  system  is  made  up 
exclusively  of  force  increments  acting  on  interface  nodes.  These 
represent  the  load  transfer  across  the  interface.  A  partial 
Gaussian  elimination  provides  a  reduced  linear  system  of 
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eqviations  relatinn  these  interface  force  increments  to  the 

interface  no<ial  'displacement  increments.  The  force  increments 

are  expresse*^  in  terms  of  a  ^  using  the  elasticity  theory  stress 

equations  an<d  the  conventional  finite  element  consistent  loaf 

procedure.  Likewise,  the  'displacement  increments  are  expressed 

in  terms  of a  •  ^  series  with  fewer  terms  that  finite  element 

n 

interface  degrees  of  freedom  is  chosen  so  that  the  system  is 

over-determined  and  solution  is  by  the  method  of  least  squares. 

The  finite  element  nodal  displacement  increments  are  calculated 

once  a  are  compute'^  and  then  strain  and  stress  increments  are 
n 

computed  throughout  the  finite  element  mesh. 

Details  of  the  finite  element  formulation  have  been 
thoroughly  discussed  in  Ref. (9).  An  average  stiffness  approach 
is  employed  to  accommodate  plastic  zone  and  flow  rule  changes 
during  a  step.  A  nonlinear  problem  is  posed  at  each  step  since 
the  averages  are  defined  in  terms  of  the  undetermined  nodal 
displacement  changes  and  the  solution  is  found  by  iteration.  We 
have  labeled  it  an  "incremental  secant  stiffness"  formulation  to 
distinguish  it  from  tangent  stiffness  formulations  which  define 
equations  on  the  basis  of  the  current  state,  Corrective 
techniques  are  necessary  with  the  tangent  formulations  ho  force 
the  stress  solution  to  satisfy  the  yield  criterion  with  the 
result  that  significant  load  imbalance  errors  can  ensue.  The 
incremental  secant  stiffness  approach  on  the  other  band 
guarantees  that  the  yield  criterion  is  satisfie'^  at  the  end  of 
each  step,  while  load  imbalance  is  controlled  by  the  interation 
covergence  tolerance. 

An  adaptive  load  incrementation  algorithm  is  used  to  control 
the  load  path  discretization  error.  Load  step  size  is  treated  as 
a  variable  and  the  solution  is  determined  according  to  a 
constraint  selecte''’  to  limit  the  constitutive  law  changes  during 
a  step.  For  the  problems  described  below  the  constraint  limited 
the  maximum  deviatoric  stress  increment  modulus  to  In  the 

case  of  the  circular  tip  flaws,  5^  steps  were  taken  to  have  the 


265 


plastic  zone  extenr^  to  within  an  element  of  the  interface  an<^ 
this  corresponded  to  a  load  level  T=.2‘^Y.  The  elliptical  flaw 
problem  required  51  steps  to  a  load  T=.35y  for  the  plastic  zone 
ho  approach  the  interface. 

III.  ELASTIC  PLASTIC  SOLUTIQMS.  We  discussed  in  Ref.  (21  the 
finding  that  for  the  circular  tip  flaws  the  material  along  the 
crack  tip  surface  between  approximately  57®  and  9(1®  elastically 
unloaded  after  having  yielded.  It  appears  that  the  expansion  of 
the  plastic  zone  into  the  region  above  the  crack  flanks  creates  a 
load  shedding  mechanism  which  results  in' the  unloading.  Directly 
related  to  this  limited  crack  surface  yielding  behavior  is  the 
nature  of  the  stress  distribution  ahead  of  the  tip.  The  maximum 
stress  value  of  2.97Y  at  x=.2.Plp  predicted  from  slipline  theory 
under  the  assumption  of  fully  plastic  conditions  up  to  the  flanks 
is  not  realized.  Instead  it  was  found  that  a  stress  maximum 
equal  to  2.57Y  develops  at  x=2.75p.  The  logarithmic  spiral 
stress  distribution  holds  only  over  the  range  from  the  crack  tip 
(x='l)  to  x=2.1Rp. 

Figure(l)  illustrates  the  elastic-plastic  limitations  to  the 
fully  plastic  stress  solution  for  the  cicular  tip  flaws  and  also 
shown  are  the  recently  obtained  results  for  the  ellintical  flaw. 
The  stress  values  are  plotted  relative  to  the  yield  stress  and 
the  distance  scale  is  normalized  with  respect  to  the  root  radius. 
The  solid  curves  are  the  slipline  predictions  assuming  that 
yielding  spreads  along  the  crack  flanks.  The  maximum  stress  of 
the  elastic-plastic  solution  is  indicated  for  each  case  as  is  the 
point  at  which  the  fully  plastic  solution  ceases  to  apply. 

Compared  to  the  circular  tip  flaws,  the  elliptical  flaw  is 
seen  to  have  a  less  severe  stress  state,  with  a  maximum  stress 
equal  to  2.44Y  at  a  distance  of  4.99 p  ahead  of  the  flaw. 

Figure(2)  provides  the  complete  elastic-plastic  distribution  for 
the  ellipse  over  a  distance  10p  ahead  of  the  tip.  It  is  seen 
that  the  solution  departs  from  the  fully  plastic  curve  at  x=4.19p. 
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The  results  are  plotted  from  discrete  element  date  along  the 
x-axis  at  a  load  level  of  f’.SSY.  Actually,  the  distribution 
shows  little  change  over  the  distance  plotted  beyond  a  remote 
tension  value  of 

J  integral  computations  were  carried  out  for  the  U-tip  flaw 
problem  to  establish  the  degree  of  path  dependence  which 
follows  from  the  non-proportional  stressing  in  the  problem. 

Paths  L  were  chosen  to  be  piecewise  linear  contours  along  element 
edges  which  encircle  the  crack  tip,  as  illustrated  in 
Figures  ( 3 , 4 ) .  The  problem  of  data  retrieval  was  managed  by  a 
scheme  which  employed  element  face  numbers  defined  by  combining 
the  numbers  of  the  nodes  on  each  face  of  the  mesh.  Paths  were 
specified  by  listing  the  nodes  along  each  path  so  that  elements 
contributing  to  each  arc  of  the  path  were  identified  by  a  face 
number  test.  Stress,  strain  and  displacement  data  from  the  two 
constant  state  triangles  sharing  each  segment  were  averaged  and 
this  provided  the  data  needed  to  compute  the  edge  contributions 
to  J. 

Results  are  given  in  Figure(5l  corresponding  to  the  1^  oaths 
drawn  in  FigLire(4)  which  are  from  the  crack  tip  surface  (r=p)  to 
r=l‘1.9  p  .  The  results  are  plotted  with  J  normalized  by  a 

reference  value  1  ,  which  was  taken  to  be  the  value  for  a 

ret 

perfectly  sharp  crack  (slit)  with  length  ?a  and  remote  tensile 
stress  T, 

'^ef= 

The  numerical  error  of  the  finite  element  solution  and  the  metho'^ 
of  computation  of  J  can  be  gauged  tfom  the  path  dependency  found 
for  the  elastic  solution.  The  elastic  values  of  J  ranged  from 
l.on?  to  .995  with  the  maximum  at  r=1.5p  and  only  minor 

variations  on  paths  beyond  5  p,  we  might  conclude  then  that  the 
model  and  computational  scheme  allow  1  computations  which  are 
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approximately  .1%  in  error. 


It  is  seen  that  before  crack  tip  unloa^inq,  at  T=.l^y,  the 

value  of  J.  .  is  below  .7  c-  the  loa'^  level  T=,29)l,  the 

tip  ref 

tip  value  of  J  is  in'?;  below  the  reference  value.  The  plot 
illustrates  the  trend  which  has  significant  T  variations  over  an 
increasing  region  from  the  tip  as  load  level  increases. 

IV.  SUMMARY.  The  elast ic-plast i c  blunt  flaw  solutions  indicate 
that  the  plastic  zone  extends  over  a  limited  nortion  of  the  crack 
surface.  This  results  in  a  stress  distribution  ahead  of  the  flaw 
which  is  less  severe  than  fully  Plastic  slipline  analysis  would 
suggest.  The  analyses  v/ere  run  to  load  levels  hinh  enough  to 
achieve  the  asymptotic  stress  distributions  over  a  ICp  distance 
ahead  of  the  flaw,  '‘'hile  the  circular  tip  flaws  have  a  maximum 
stress  of  ?.57Y  at  2.7'^P  below  the  crack  surface,  the  elliptical 
flaw  has  a  maximum  stress  of  at  '1.9'’ p. 

J  integral  computations  for  the  U-tip  flaw  indicate  that 
there  is  significant  path  depdendency.  The  value  of  in 

comparison  to  the  remote  value  of  J  changes  with  load  level 
consistent  with  the  evolution  of  the  solution  to  the  fully 
developed  crack  tin  region  stress  state.  The  results  suggest 
that  fracture  criteria  based  on  J  should  account  for  differences 
in  the  tip  and  remote  values  of  J  when  representing  the  severity 
of  the  deformation  state  in  the  fracture  prone  crack  tip 
material . 
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Fig.1  Elastic-Plastic  Limitations  to  Fuly  Plastic  Stress  Solutions  for  Three  Crack  Models 


a 
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Fig.2  Stress  Variation  Near  EHptical  Flaw 


Rg.3  J  hvtegral  Contours  Atong  Common  Edges  of  Shaded  Triangular  Elements 
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Fig.4  Contours  Used  in  J  Integral  Computations 


THE  EAUSCHINGER  EFFECT  ON  STRESS  INTENSITY  FACTORS 
FOR  A  RADIALLY  CRACKED  GUN  TUBE 

S.  L.  Pu  and  P.  C.  T.  Chen 
U>S.  Army  Armament,  Munitions,  and  Chemical  Command 
Armament  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
Watervliet,  NY  12189-5000 


ABSTRACT.  The  theoretical  predicted  fatigue  life  of  a  high-strength 
steel  tube  which  has  undergone  an  autofrettage  procedure  is  significantly 
higher  than  the  experimental  prediction.  To  account  for  the  discrepancy, 
attention  is  now  turned  to  developing  better  elastic-plastic  models  for  a  high 
strength  steel.  An  improved  material  model  shows  that  reverse  yielding  may 
occur  in  the  inner  portion  of  the  tube.  This  reverse  yielding  reduces  the 
residual  compressive  hoop  stress  considerably  which  has  an  adverse  effect  on 
bore  crack  propagation.  This  study  considers  the  stress  intensity  factors  due 
to  a  radial  crack  taking  the  Bauschlnger  effect  into  consideration. 

The  elastic-plastic  interfaces  during  loading  and  unloading  in  the 
autofrettage  process  divide  the  tube  into  three— ring  regions.  The  residual 
stress  distribution  in  each  region  is  quite  different.  When  a  crack  grows 
from  one  region  into  another,  the  previous  method  using  functional  stress 
intensity  fails.  A  new  method  is  used  to  obtain  stress  intensity  factors  for 
a  radial  crack  growing  out  of  the  reverse  yielding  zone.  This  approach  is 
based  on  crack  face  weight  functions  obtained  by  Sha  using  stiffness 
derivative  finite  element  techniques  coupled  with  singular  crack-tip  elements. 


I .  INTRODUCTION .  An  early  brittle  failure  of  a  cannon  tube  during  the 
1960's  prompted  a  renewed  interest  in  fracture  analysis  of  cannon  pressure 
vessels  [1].  After  careful  investigation  of  the  cause  of  fracture,  some  basic 
design  changes  were  made  to  prevent  any  further  failures.  One  of  the  design 
changes  was  to  introduce  a  compressive  residual  stress  near  the  inner  radius 
of  the  cannon  by  an  autofrettage  process.  Considerable  efforts  have  been  made 
to  predict  the  residual  stress  distribution  in  an  overstrained  tube  based  on 
various  material  models  [2-5] .  In  order  to  confirm  the  theoretical 
predictions  of  residual  stresses,  experimental  methods  [6,7]  have  been 
initiated  in  our  laboratory. 

Since  the  current  cannon  tube  design  uses  a  wall  ratio  close  to  two,  the 
maximum  compressive  residual  stress  at  the  bore  for  a  fully  autofrettaged  tube 
is  about  85  percent  of  the  yield  stress  Oq  of  the  material,  tfost  of  the 
earlier  predictions  of  residual  stresses  were  based  on  the  assumption  of 
elastic  unloading.  According  to  Milligan  et  al  [8],  the  high  strength  gun 
steel  has  a  very  high  Bauschlnger  effect.  To  account  for  the  compressive 
yield  strength  reduction  due  to  the  Bauschlnger  effect,  Underwood  and  Kendall 
[9]  have  tried  to  estimate  the  residual  stress  distribution  and  its  effect  on 
fatigue  crack  growth  rate.  Recently,  Chen  obtained  a  closed  form  solution  of 
residual  stresses  in  autofrettaged  tubes  based  on  a  theoretical  material  model 
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taking  both  Bauschlnger  and  hardening  effects  into  consideration  [10] .  The 
new  residual  stress  distribution  is  very  different  from  that  obtained  earlier 
based  on  the  assumption  of  elastic  unloading.  The  functional  stress  intensity 
method  developed  for  the  computation  of  stress  intensity  factors  of  a  radial 
crack  initiating  from  the  inner  radius  of  a  tube  may  fail  due  to  the  presence 
of  a  reverse  yielding  region  near  the  bore.  The  objective  of  this  paper  is  to 
develop  a  numerical  method  to  overcome  such  a  difficulty  and  to  study  the 
Bauschlnger  effect  on  stress  intensity  factors  for  a  radially  cracked, 
partially  autofrettaged  gun  tube. 

II.  THE  BAUSCHINGBR  EFFECT  AND  RESIDUAL  STRESSES.  The  phenomenon  that  a 
material  lowers  its  elastic  limit  in  compression  (tension)  subsequent  to  a 
previous  stressing  in  tension  (compression)  beyond  the  elastic  limit  is  called 
the  Bauschlnger  effect.  A  quantity  representing  the  magnitude  of  the 
Bauschlnger  effect  is  the  Bauschlnger  effect  factor  (BEF).  This  is  defined  as 
the  ratio  of  the  yield  stress  upon  reverse  loading  to  the  initial  yield  stress 
(Oq).  The  BEF  (f)  is  a  function  of  percent  overstrain  ( e^) .  The  graph  of  BEF 
vs.  percent  overstrain  obtained  by  Milligan  [8]  for  a  modified  4330  steel 
having  a  martensitic  structure  is  shown  in  Figure  1  which  was  used  by  Chen  in 
his  computations  of  residual  stresses. 

Taking  the  Bauschlnger  effect  factor  f  into  consideration,  Chen  obtained 
the  closed  form  solution  of  residual  stresses  in  autofrettaged  tubes  [10] .  He 
assuned  a  material  model  which  exhibits  the  stress-strain  curve  shown  in 
Figure  2  during  tensile  loading  and  unloading  after  overstrain.  The 
assumption  of  elastic-perf ectly  plastic  loading  was  supported  by  the  fact  that 
very  little  strain-hardening  was  observed  in  the  tensile  test.  A  bilinear 
model  for  elastic-plastic  unloading  was  assumed  since  a  large  slope  of  strain- 
hardening  (m'E)  did  develop  after  the  occurrence  of  reverse  yielding. 

Referring  to  the  point  O'  in  Figure  2  as  the  origin  of  a  new  (primed) 
coordinate  system  (cj*,e*),  the  reverse  yielding  curve  can  be  expressed  as 

o' 

—  =  1  +  f  +  m'  ?/(l-m')  (1) 

^^o 

where  C  *  (E/o<j)e'^,  E  is  Young's  modulus,  is  the  plastic  strain  in  the 
primed  coordinates.  The  final  residual  stress  state  (denoted  by  a  double 
prime)  is  obtained  by  summing  the  stress  state  corresponding  to  the  elastic- 
perfectly  plastic  loading  and  the  primed  stress  state  corresponding  to 
unloading.  The  tangential  and  radial  components  can  be  written  as 

Oq"  =  00  +  00*  ,  Or"  »  Or  +  Op'  (2) 


Elastic  Plastic  Loading 

Let  a  and  b  be  the  inner  and  outer  radii  of  the  cylinder,  respectively, 
and  let  the  material  be  elastic-perf ectly  plastic,  obeying  the  Tresca's  yield 
criterion,  the  stress  components  are  given  by  [2] 

®r/  ®o  1  p  ^  p 

=  -  (+  1  +  )  ^  ^  a  <  r  <  p  (3) 

oe/oo  2  b2  r 
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(4) 


1  (ff  .  ff, 

Vo  O/ 


P  <  r  <  b 


Or/Oo 
00/00  2 

where  p  is  the  elastic-plastic  boundary  relating  to  the  internal  pressure  p  by 


p  1  P 

-  (1  -  --5)  +  iln  (-) 

Oo  2  b^  a 


The  equivalent  plastic  strain  can  be  calculated  by 

)2 

r2 

where 


"  eP  =  ei  (^  -  1)  »  a  <  r  S  p 


h - (1-''^) 

/3 


(5) 

(6) 

(7) 


In  Eq.  (7)  v  is  the  ratio  of  Poisson. 

Elastic-Plastic  Unloading 

If  the  internal  pressure  p  is  subsequently  removed  completely,  the 
unloading  may  be  either  elastic  or  elastic-plastic  depending  on  whether  the 
magnitude  of  p  is  less  than  or  greater  than  p^  which  is  the  minimum  pressure 
to  cause  the  reverse  yielding  to  occur.  If  p  <''-pm,  the  unloading  is  entirely 
elastic  and  the  stress  components  are 


00'  b2 


Using  Tresca' s  criterion  subject  to  o^”  >  O2"  >  00*’,  the  reverse  yielding  will 
not  occur  if 


b^ 

[±  - 11 


(8) 


-  00"  <  fOo 


(9) 


Substituting  Eqs.  (2),  (3),  and  (8)  into  (9),  we  obtain  the  expression  for  p^, 

,2 


Pm  1  a' 

- (l+f)(l  -  —J 


(10) 


If  p  >  Pm,  reverse  yielding  will  occur  in  a  region  a  <  r  <  p’  with  p'  <  p 
upon  unloading  and  we  have  from  Eq.  (1) 

0J."  -  00"  =  fOQ  +  m*Ee’P/(l-m' )  in  a  <  r  <  p*  (11) 
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The  stresses  in  the  plastic  and  elastic  regions  are  given  by  [3] 


^r’/Oo  “  P/^^o  -  -  e2'<l+f)(--)^(l  -  -^)  -  (l-e2')(l+f)An(-)  ,  (a«;r<p’)  (12) 

2  3.  r  3 


oq'/Oc,  =  Or'/Oo  -  (1+f)  '  ni'?7(l-ni’)  .  (a<r<p') 


(13) 


where 


<7rVcJo 

Oe’/do 


1  p*  „  p’  , 

-  (l+f)t±(— )^  -  (--)^] 

2  r  b 


(P’<r<b) 


3i'  =  (l-m' )/(m'+(l-m' )/ 3i] 
32*  =  m*  3i’/(l-m') 

V  =  3i*(l+f)(p'2/r2-l) 


(U) 

(15) 

(16) 

(17) 

(18) 


The  value  of  p’  can  be  found  from  continuity  of  at  p*  from  Eqs*  (12)  and 
(14),  Figure  3  shows  different  residual  hoop  stress  distributions  for  a 
hollow  cylinder  with  a  wall  ratio  of  two  and  E/a^^  =  200,  v  =  0*3,  p/a  =  1.6 
and  2.0.  The  broken  lines  are  elastic  unloading  solutions  (f=l),  while  solid 
lines  are  elastic-plastic  unloading  with  =  0  and  0-3.  It  shows  a  drastic 
difference  in  00”  in  the  reverse  yielding  region  near  the  bore  and  p'  varies 
slightly  with  p  and  la'  * 


111.  FUNCTIONAL  STRESS  INTENSITY  METHOD-  We  have  developed  the 
functional  stress  Intensity  method  [11]  for  the  computation  of  stress 
intensity  factors  of  radial  cracks  in  a  pressurized  and  partially 
autof rettaged  cylinder  by  combining  the  finite  element  method  and  the  weight 
function  method.  A  weight  function  vector  _h  is  a  universal  function  which 
depends  only  on  geometry  and  not  on  loadings  [12] .  For  a  given  radially 
cracked  ring,  if  the  mode  I  stress  intensity  factor,  and  the 

normal  component  of  crack  face  displacement  associated  with  a  symmetric  load 
systaa  1  are  known,  then  the  normal  component  of  the  crack  face  weight 
functions  can  be  expressed  by 


^ly(^  9 


H  3v(^)(x,il) 
2K3-(I) 


(19) 


where  H  =  E  for  plane  stress  and  H  -  E/(l-v^)  for  plane  strain,  x  -  (r-a)  is  a 
distance  measured  from  the  base  of  the  crack  along  the  crack  face  toward  the 
crack  tip,  and  I  is  the  crack  length.  For  any  symmetric  load  applied  to  the 
same  cracked  ting,  the  stress  intensity  factor  associated  with  the  new  load 
can  be  found  from 


H  £  3vCl) 

/  p(x)  dx  (20) 

K  (1)  0  U 
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where  p(x)  is  the  normal  stress  in  the  tangential  direction  at  the  crack  site 
due  to  the  new  load  applied  to  the  uncracked  ring*  If  the  uniform  tension  p^ 
at  the  outer  radius  is  taken  as  load  one,  we  can  find  numerical  values  of 
Kj(l)  and  by  the  finite  element  method.  We  wish  to  use  Eq.  (20)  to 

obtain  Kj  associated  with  the  autofrettage  residual  stresses.  It  can  be  seen 
from  Eqs.  (2),  (3),  (8),  and  (12)  through  (15)  that  the  residual  stress  Og" 
has  the  general  expression 

Og'Xr)  =  Ai  +  A2/r^  +  A3  Mr)  (21) 

for  a  tube  subject  to  an  elastic-plastic  loading  prior  to  an  elastic 
unloading,  where  Aj,  i  *  1*2,3  are  superposition  constants. 

Let  p(x)  in  Eq.  (20)  be  og"  above.  The  Integration  to  find  Kj  requires 
an  expression  for  3v(r)/3i.  A  method  which  assumes  v(^)  as  a  conic  section 
[13]  has  been  used  by  Grandt  [14].  Another  method  which  avoids  the  need  of 
3v(l)/3)l  has  been  developed  by  Pu.  He  uses  finite  elements  to  obtain 
functional  stress  intensities  Kj.(l),  K(.(r"2)^  and  Kj.(  Inr)  defined  by 

H  3v(l) 

•^cCp)  --77-  !  p  -jr-  ■1*  <22) 

k(1)  o  31 

for  a  crack  face  loading  p  «  1,  p  =  r~^,  and  p  =  Mr),  respectively.  Once 
the  functional  stress  Intensities  are  known,  then  the  stress  intensity  factor 
^l(p)  c®''  be  found  for  a  general  residual  stress  distribution  of  Eq.  (21)  by 
an  algebraic  equation 

Ki(p)  =  AiKc(1)  +  A2Kc(r-2)  +  A3Kc(  &ir)  (23) 

This  method  was  successfully  applied  to  various  residual  stresses  predicted 
from  various  material  models  [15], 

Now  for  a  radial  crack  in  a  reverse  yielding  zone,  the  stress  intensity 
factor  can  still  be  found  by  the  functional  stress  Intensity  method  since  the 
residual  stress  remains  the  same  form  as  Eq,  (21)  with  the  superposition 
constants  obtainable  from  Eqs-  (3)  and  (13),  However,  if  the  radial  crack  is 
longer  than  the  reverse  yielding  zone,  the  functional  stress  Intensity  method 
fails  since  the  crack  face  loading  p(x)  has  two  different  expressions  in  two 
different  regions:  0  <  x  <  p^-a  and  p’-a  <  x  <  Let  Py(x)  and  Pe(x)  be  the 
crack  face  loading  (residual  hoop  stress)  in  the  reverse  yielding  region  and 
the  elastic  region-  Equation  (20)  becomes 

Ki  =  2/^  PY(x)hiy(x,  i)dx  +  (24) 

To  use  this  equation,  we  have  to  find  the  explicit  crack  face  weight  function 
hjy  using  the  stiffness  derivative  finite  element  technique  explained  in  the 
next  section. 
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IV.  EXPLICIT  CRACK  FACE  WEIGHT  FUMCTIOH.  The  stiffness  derivative 
method  for  determining  linear  elastic  stress  Intensity  factors  was  introduced 
by  Parks  [16],  The  method  was  extended  to  calculate  the  weight  function 
vector  field  by  Parks  et  al  [17] .  An  efficient  finite  element  evaluation  of 
explicit  weight  functions  has  been  established  by  Sha  [18]  who  combines  the 
stiffness  derivative  technique  with  special  singular  crack  tip  elements.  He 
applied  the  method  to  radial  crack  problems  of  a  hollow  disk  [19]  .  The 
degenerated  quarter-point  quadratic  elements  were  used  around  a  crack  tip. 
These  singular  elements  were  surrounded  by  the  standard  eight-node 
quadrilateral  elements.  The  virtual  crack  extension  of  an  amount  was 

simulated  by  advancing  the  crack-tip  node  by  <5A  in  the  direction  collnear  with 
the  mode  I  radial  crack  (x-dlr action) .  The  surrounding  quarter-point  nodes 
were  also  shifted  to  new  locations  and  there  was  no  change  in  location  of  all 
other  nodes.  Hence  only  a  few  crack— tip  elements  have  experienced  changes  in 
elemental  stiffness  due  to  the  virtual  crack  extension.  This  makes  the 
stiffness  derivative  technique  very  efficient  from  a  computational  viewpoint. 

From  the  displacement  form  of  the  finite  element  method,  the  equation  of 
equilibrium  is 


[K] {u}  =  {F}  (25) 

where  [K]  is  the  global  sitffness  matrix,  {u}  and  {F}  are  displacement  vector 
and  load  vector,  respectively.  The  change  in  displacement  per  unit  crack 
extensions  can  be  found  by  differentiating  Eq.  (25)  with  respect  to  crack 
length  % 


d{u} 

dA 


,  d{F}  d[K] 

[K]-l[ - i-i  {u}] 

dA  dA 


(26) 


Note  that  d{F}/dA  =  0  if  we  select  a  load  system  which  consists  of  only 
surface  tractions  not  applied  on  the  crack  face.  The  global  stiffness 
derivative  d[K]/dA  is  the  sum  of  N^.  elemental  stiffness  derivatives 


d[K]  Nc  d[ki] 
d  A  d  A 


(27) 


where  N^,  is  the  number  of  crack  tip  elements  and  [kj]  is  the  element  stiffness 
of  the  ith  crack  tip  element.  For  a  small  crack  extension  6A,  the  element 
stiffness  derivative  may  be  approximated  by 


d[ki]  [l<^iU+6£  -  [l^il 
dA  6A 


(28) 


where  [kj]£^5jl^  and  [ki]£  are  element  stiffness  after  and  before  virtual  crack 
extension,  respectively. 

The  displacement  vector  is  a  function  of  position  (x,y)  and  the  crack 
length  A 
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{u)  =  {u(x,y,Z)} 


(29) 


Applying  the  chain  rule  of  differentiation  of  Eq.  (29)  with  respect  to  I 
gives 


3{u}  d{u}  3{u}  dx  3{u}  dy 

34  d£  3x  dA  3y  dA 


(30) 


For  a  mode  I  crack  lying  on  the  x-axis  with  colinear  virtual  crack  extension, 
we  have  dx/dl  =  1  and  dy/dZ  -  0.  Definitions  of  global  coordinates  and 
displacements  for  isoparametric  elements  are 


X  =  I  %(5,n)xi 

y  =  I  Ni(5,n)yi 


u  =  I  Ni(  C,h)ui 

V  =  I  Ni(5,Tl)Vi 


(31) 


where  5,n  are  local  coordinates;  x,y  are  global  coordinates;  xi.yi  are  global 
coordinates  of  node  i;  and  are  x  and  y  components  of  displacement  at 
node  1;  %(^,n)  are  shape  functions  which  interpolate  the  displacement  over 
the  element.  The  finite  element  evaluation  of  3{u}/3x  can  be  carried  out  from 


3{u}  1  3y  ^[Ni’^il  3y^ 

3x  det[J]  35  3ti  3n  35 


(32) 


where  [J]  is  the  Jacobian  matrix.  This  leads  to  the  expression  for  the  y 
component  of  mode  I  weight  function  vector  at  (x,y) 


hiy(x,y,l) 


H 

Ki(U(£) 


dv(l) 


1 

det[J] 


3[Niui]  3y 

{ - - 

35  3n 


3[NiUi]  3y 

- -}] 

35  95 


(33) 


For  a  hollow  disk  of  b/a  =  2,  Sha  and  Yang  [19]  have  obtained  explicit 
weight  function  for  a  single  bore  or  rim  radial  crack.  In  private 
communication,  Sha  has  provided  the  following  expression  to  approximate  the 
dimensionless  crack  face  weight  function  component 

Tg  4  Tg  ^/2-l 

2/1  hiy(— )  =  I  Dn(— )  (34) 

I  n=l  ^ 


where  rg,  the  distance  from  the  crack  tip  along  the  crack  face,  is  related  to 
X  and  r  by 


rg  -  il-x  =  Jl+a-r  (35) 

The  coefficients  0^^  n  -  1, 2,3,4,  determined  by  the  least  square  technique, 
are  given  in  the  following  table  for  various  crack  lengths- 
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TABLE  1.  LEAST  SQUARE  FITTED  COEFFICIENTS  OF  EQUATION  (34)  FOR  A  SINGLE  BORE 
CRACK  IN  A  HOLLOW  CYLINDER  OF  b/a  =  2  WITH  VARIOUS  CRACK  LENGTHS  c  = 
A/(b-a) 


1  b-a 

1 

Dl 

D2 

D3 

1 

1 

D4 

1 - i 

1  0.01 

0.7966D-K)0 

-0.7721D-02 

0.2935D+00 

0.3620D+00 

1  0.02 

0.7976D+00 

-0.6980D-02 

0.2840D-K)0 

0.3608D+00 

1  0,03 

0.8130D+00 

-0.7176D-01 

0.3711D+00 

0.3134D+00 

1  0.05 

0.8008D+00 

-0.5811D-02 

0.2649D+00 

0.35511>t00 

1  0.06 

0.8335D+00 

-0.1714D+00 

0.4942D+00 

0.2421D+00 

1  0.08 

0.8051D+00 

-0.2789D-01 

0.3116D+00 

0.3270D+00 

1  0.10 

0.8286D+00 

-0.1818D+00 

0.6119D+00 

0.15540+00 

1  0.20 

0.8132D+00 

-0.8254D-01 

0.5769D+00 

0.2414D+00 

1  0.30 

0.8026D+00 

0.1409D-02 

0.5881D+00 

0.3756D+00 

1  0.40 

0.7981D+00 

0.4661D-01 

0.6908D+00 

0,49350+00 

1  0.50 

0.7956D+00 

0.7236D-01 

0.9065D+00 

0.5437D+00 

1  0.60 

0.7990D+00 

0.4395D-01 

0.1361D+01 

0.4059D+00 

1  0.70 

0.8004D+00 

0.2255D'01 

0.2130D+01 

-0.3915D-01 

1  0.80 

J _ 

0.7976D+00 

0.4388D-01 

L  -  -  ™ 

0.3537D+01 

-0.1131D+01 

V.  STRESS  INTENSITY  FACTORS.  To  use  Eqs.  (34)  and  (24),  we  need  a 
certain  transformation  of  variables.  Let  us  use  a,  the  bore  radius,  to 
normalize  all  linear  lengths  and  use  the  same  notations  (before  normalization) 
to  denote  the  normalized  lengths  except  that  a  is  unit  and  b  is  w,  the  wall 
ratio.  Denote  by  t,  Eq.  (24)  associated  with  the  residual  hoop  stress 

becomes 


00"  4 

- )e(  I  DnW2-l)dT 


®o  n=l 


1  06"  ^  1  0  0"  09"  ^  , 

“  /  ( - )e  (  I  DnTn/2-l)dT+  /  [( - )  -  ( - )e][  I  DnW2-l]dT  (36) 

o  Oq  T  Oq  Y  Oq 

where  residual  hoop  stress  in  the  elastic  unloading  region, 

while  (cf0'*/tJo)Y  that  in  the  reverse  yielding  region  and  x’  is  the  value  of 

T  corresponding  to  r  -  p',  the  elastic-plastic  interface  during  unloading. 

The  residual  hoop  stress  can  be  represented  by  the  general  form 

00 

(  )e  =  Ale  +  A2e  r-2(x)  +  A3e  Jla(r/T)  (37) 

<^o 

•"e" 

( - )y  =  Aiy  +  A2y  r“^(T)  +  A3Y  Aii(r/  x)  (38) 

°o 
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where  the  coefficients  Aig,...,A3Y,  can  be  obtained  from  Eqs.  (2),  (3),  and 
(12)  through  (15).  From  Eq.  (35)  and  the  definition  of  t,  we  have 

r(T)  =  £(Vt)  .  ^o  =  (1+^)/^  (39) 

Denoting  Ij^g(a,g),  l2n(“»®)»  ^3n(®*^)  following  definite  integrals, 

g 

^ln(«.P)  =  /  W2-ldT 


g 

l2n(«.e)  =  /  r-2(T)Tn/2-l<iT  n  =  1,2,3,4  (40) 

a 


l3n(«.3)  =  in(r(T))W2-ldT 
a 


Eq.  (36)  can  be  written  in  the  form 

Ki  1  3  4  3  4 

- [  I  Aie  1  Dnlin(0»l)  +  I  (AiyAie)  I  Dnlin(^’.l)  W 

/it  i=»l  n-1  i=l  n=l 


The  expressions  of  integrals  a, 3) ,  i  -  1^2 ^3^  n  -  are  easy  to 

carry  out  and  are  omitted*  Equation  (41)  can  be  used  for  any  loading  which 
yields  a  hoop  stress  of  the  form  given  by  Eq.  (21)  and  for  any  multiply- 
cracked  cylinder  as  long  as  the  coefficients  are  known  for  that  particular 
cracked  geometry.  In  case  there  is  no  reverse  yielding  region,  Eq.  (41) 
reduces  to 


Ki  1  3  4 

ZZ,  ~  "p.  ^  ^ie  I  ^n^ln(^»^)  (^2) 

o^Viil  /tt  i=l  n-1 

For  a  shallow  crack  which  lies  entirely  in  the  reverse  yielding  region,  Eq. 
(41)  becomes 

Ki  I  3  4 

_  =  I  AiY  I  Dglig(0,l)  (43) 

Oq/WI  /tt  i=l  n=l 

Table  1  gives  Dj^  for  single  bore  crack  of  various  discrete  crack  lengths.  For 
a  crack  length  not  given  In  Table  1,  Sha  [19]  suggested  the  use  of  cubic 
spline  interpolation  to  obtain  the  weight  function  for  that  length  from 
discrete  values  in  Table  1  followed  by  the  least  square  technique  to  calculate 

D*  s. 

Equation  (42)  is  used  to  compute  Ki/pQ/nf  for  a  single  bore  crack  in  a 
hollow  cylinder  of  w  =  2  subject  to  uniform  tension  p^  at  the  outside 
cylindrical  surface.  The  Lame"'  solution  gives  Ai  =  A2  ®  wV(w^l)  and  A3  ®  0. 
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Numerical  results  for  various  crack  lengths  are  very  accurate  in  comparison 
with  results  previously  reported  in  [20]  .  Further  check  of  ntjmierical 
computations  of  from  Eq*  (42)  is  done  for  a  partially  autof rettaged 

cylinder  without  reverse  yielding*  The  results  confirm  those  published  in 

[21]- 

For  a  cylinder  of  w  =  2,  the  Bauschinger  effect  factors  for  60  percent, 

80  percent,  and  100  percent  overstrain  are,  from  Figure  1,  f  -  0.44,  0-42,  and 
0.38,  respectively.  Using  0-3,  E/ 0^  =  200,  the  elastic-plastic  interface 
during  unloading  depends  on  f  and  •  It  varies  from  p*  =  1.106  for  e  = 

0.6  (f  -  0,44)  and  m*  =  0.3  to  p*  1.20  for  e  -  1  (f  -  0.38)  and  -  0.  The 
residual  hoop  stress  distributions  near  the  bore  for  these  two  cases  are  shown 
in  solid  lines  in  Figure  3  while  broken  lines  are  corresponding  stresses 
without  taking  Bauschinger  effect  into  consideration.  The  residual  stress  in 
the  reverse  yielding  region  varies  drastically  with  f  and  m' ,  Stress 
Intensity  factors  due  to  residual  stresses  are  calculated  from  Eq-  (43)  or 
(41)  depending  on  whether  the  crack  tip  is  inside  or  outside  of  the  reverse 
yielding  zone.  Figures  4  and  5  are  graphs  of  Kj/(  0^/71^)  as  a  function  of 
dimensionless  crack  length  c  =  i/(w-l)  for  e  -  0.6  and  e  -  1,  respectively. 
Superposing  stress  intensity  factors  due  to  an  internal  pressure  p  =  a^/Lf, 
the  combined  stress  intensity  factors  from  both  p  and  residual  stresses  are 
shown  in  Figures  6  and  7  for  Lf  =  3  and  for  e  0.6  and  e  *  1.0,  respectively* 
Figure  8  Is  a  similar  graph  for  Lf  =  1,5  and  e  =  1.0.  In  Figures  6,  7,  and  8, 
the  curves  in  the  centerline  correspond  to  m’  =0,  the  curves  in  the  broken 
line  are  for  m’  =0.3,  while  the  solid  lines  are  results  obtained  earlier 
without  taking  reverse  yielding  into  consideration.  For  c  =  0.01,  Figure  7, 
values  of  Kj/ ( aQ/iril)  are  0.55  and  0.37  for  m’  =  0  and  0*3,  respectively, 
versus  the  corresponding  value  0.058  from  the  solid  line.  It  indicates  that 
the  Bauschinger  effect  will  greatly  reduce  the  advantageous  effect  of 
compressive  residual  hoop  stress  introduced  by  the  autofret tage  process* 

VI.  CQNCLLfSION.  The  Bauschinger  effect  of  the  high  strength  gun  steel 
causes  the  presence  of  the  reverse  yielding  in  a  partially  autof rettaged 
cannon  tube  of  wall  ratio  of  two*  This  reverse  yielding  reduces  the  magnitude 
of  compressive  hoop  stress  considerably  in  the  reverse  yielding  region-  This 
stress  reduction  will  result  in  much  higher  stress  intensity  factors  in  a 
shallow  bore  crack  In  a  pressurized  and  autof rettaged  gun  tube*  The  higher 
the  stress  intensity  factor  Implies  the  lower  the  fatigue  life. 

The  expression  in  Eq-  (34)  for  explicit  crack  face  weight  function 
obtained  by  a  combination  of  stiffness  derivative  technique  and  special 
singular  crack  tip  elements  is  highly  accurate.  It  recovers  the  previous 
stress  intensity  factor  results  for  a  single  bore  crack  in  a  tube  subject  to 
various  loading  conditions*  The  method  can  treat  any  crack  face  loading 
including  stress  discontinuities  and  stress  gradient  discontinuities  over  the 
crack  face* 

Numerical  results  are  limited  to  single  bore  radial  cracks  in  this  paper* 
However,  the  method  is  general  for  either  bore  cracks  or  rim  cracks  for  any 
number  of  cracks  as  long  as  we  have  obtained  the  explicit  crack  face  weight 
function  for  that  particular  cracked  geometry- 
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Stress-strain  curves  during  loading  and  unloading 
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FIGURE  3 

Dependence  of  on  f  and  m'  in  the  reverse  yielding  region  of 

a  cylinder  of  wall  ratio  of  two 
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FIGURE  4 

Dimensionless  stress  intensity  factors  as  a  function  of  dimensionless  crack  length 
c  due  to  compressive  hoop  stress  in  a  cylinder  having  60  percend  degree  of  autofret 

tage  for  various  values  of  f  and  m' 


FIGURE  5 


Dimensionless  stress  intensity  factors  as  a  function  of  dimensionless  crack  length 
c  due  to  compressive  hoop  stress  in  a  cylinder  having  100  percent  degree  of  autofret 

tage  for  various  values  of  f  and  m' 
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FIGURE  6 

Resultant  stress  intensity  factors  as  a  function  of  crack  length  c  for  various 
values  of  f  and  m'  in  a  pressurized,  autofrettaged  cylinder.  Internal  pressure 
p  =  ct^/3  and  degree  of  autofrettage  e  =  0.6 
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FIGURE  7 

Resultant  stress  intensity  factors  as  a  function  of  crack  length  c  for  various 
values  of  f  and  m'  in  a  pressurized,  autofrettaged  cylinder.  Internal  pressure 
p  =  a^/3  and  degree  of  autofrettage  £=1.0 
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Resultant  stress  intensity  factors  as  a  function  of  crack  length  c  for  various 
values  of  f  and  m'  in  a  pressurized,  autofrettaged  cylinder.  Internal  pressure 
D  =  a  /1. 5  and  decree  of  autofrettage  e  =  l  ,o 
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ELASTIC-PLASTIC  LOADING  AND  UNLOADING  IN  A  THICK  TUBE 
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ABSTRACT*  Using  Tresca’s  yield  criterion,  its  associated  flow  rule,  and 
the  linear  hardening  law,  analytical  solutions  are  obtained  for  elastic- 
plastic  loading  and  unloading  problems  in  a  thick  tube  subjected  to  uniform 
internal  pressure.  Explicit  expressions  for  the  displacement,  strains,  and 
stresses  are  presented  and  numerical  results  in  a  closed-end  tube  are 
calculated* 

I.  INTRODUCTION*  The  importance  of  the  determination  of  residual 
stresses  in  a  prestressed  thick-walled  cylinder  is  well-known  and  elastic- 
plastic  loading  response  has  been  extensively  studied  [1-5] *  Most  of  the 
earlier  solutions  for  residual  stresses  were  based  on  the  assumption  of 
elastic  unloading  and  only  a  few  considered  elastic-plastic  unloading  [2,5]. 
Bland’s  work  [2],  which  neglects  the  Bauschinger  effect,  is  based  on  the 
Tresca's  yield  condition  and  isotropic  hardening  rule-  Kinematic  hardening  is 
the  simplest  theory  that  can  model  the  Bauschinger  effect  [6,7]*  If  unloading 
does  not  occur,  there  is  no  difference  between  the  kinematic  and  isotropic 
hardening  models-  For  unloading  with  reverse  yielding,  the  results  based  on 
these  two  models  will  be  different  as  shown  in  a  recent  paper  [5]  using  the 
ADINA  finite  element  code  [8] -  The  von  Mises’  yield  condition  and  its 
associated  flow  rules  were  used  in  both  models- 

In  this  paper  a  closed-form  solution  for  elastic-plastic  loading  and 
unloading  in  pressurized  thick-walled  cylinders  is  presented  using  Tresca’s 
yield  criterion.  Its  associated  flow  rule,  and  the  linear  kinematic  hardening 
law-  Numerical  results  are  calculated  for  a  closed-end  tube. 

II-  ELASTIC-PLASTIC  LOADING*  Consider  a  thick-walled  cylinder,  internal 
radius  a  and  external  radius  b,  which  is  subjected  to  Internal  pressure  p* 

The  material  Is  assumed  to  be  elastic-plastic,  obeying  the  Tresca’s  yield 
criterion,  the  associated  flow  theory,  and  a  linear  strain-hardening  rule- 
Using  the  isotropic  hardening  theory,  the  elastic-plastic  solution  has  been 
obtained  by  Bland  [2].  In  order  to  consider  the  Bauschinger  effect,  the 
kinematic  hardening  theory  is  used  here*  Subject  to  the  condition  O0  >  > 

Tresca’s  yield  criterion  for  the  Prager-hardening  rule  [6]  states  that 
yielding  occurs  when 


where 


(00-00)  -  (aj.-Oj.)  =  Kq 


P 


00  =  ceq 


P 


Oj.  =  ce^ 


(1) 

(2) 
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define  the  position  of  the  center  of  the  yield  surface,  c  is  a  material 
constant  and  Kq  the  Initial  yield  stress.  The  associated  flow  rule  states 
that 


de0  =  -dEp  and  de^  =0 

For  the  case  of  linear  strain-hardening,  the  yield  stress  curve  can  be 
represented  by  a  straight  line, 

K/Kq  =  1  +  ne^  and  n  “  (E/Ko)m/(l-m) 

p 

where  n  is  a  material  constant  and  the  equivalent  plastic  strain  e  is 
by 

^  ~  ms  [  ((dee") 2  +  .  --  e/ 

/3 

The  elastic-plastic  solution  for  the  stresses  and  displacements  can  be 
obtained  explicitly.  The  expressions  in  the  plastic  range  (a  <  r  <  p) 

®r/^  I  I  P 

=  -  +  Ti>  +  r  (-2  -  1)  -  (1  -  h0)log  - 

Oe/Ko  2  b^  2  r2  r 


Oz/Eq  “  vp^/b^  -  2v(l-n3)  log  -  +  Ee_/Ko 

r 

(E/Ko)(u/r)  =  (l-2v)(l+v)(Oi./Ko)  -  VEEz/Kq  +  (l-v2)p2/r2 
and  in  the  elastic  range  (p  <  r  <  b) 

cJr/Ko 

<Je/Ko 

o^/Ko  =  Ee^/Ko  +  vp^b^ 


1  -  p\ 

2  ^b2  ^ 


(E/KQ)u/r  =  -  (l+v)[p2/r^+  (l-2v)p2/b2]  - 


where 


and 


,  ( ii-2  v) 

M  =  0  (open-end)  ,  1  (closed-end) 


✓I 

3-1  =  n  +  --  (E/Ko)/(1-u2) 


(3) 

(4) 
defined 

(5) 

are 

(6) 

(7) 

(8) 
(9) 

(10) 

(11) 

(12) 

(13) 

(14) 


(15) 
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The  yield  surface  moves  in  translation  during  plastic  deformation  as  given  by 

00  =  -n^  =  (v^/2)ce^  and  =  SCp^/rM)  (16) 

The  elastic  plastic  surface  p  is  related  to  the  Internal  pressure  p  by 

p/Ko  =  J  (l-p2/b2)  +  (l-nB)log  p/a  +  ^  (p2/a2-l)  (17) 

III.  REVERSE  YIELDING.  If  the  pressure  p  given  by  Eq.  (17)  is 
subsequently  removed  completely  with  no  reverse  yielding,  the  unloading  is 
entirely  elastic  and  the  solution  is  given  by 


Ur 

00’ 


P  ^ 

=  -9-/7 - [-  '2  ■ 

-  1  r^ 


Oz'  =  v(ar'+a0')  +  Ee^’ 

Ee^'  =  -(ii-2v)p/(b2/a2-i) 

EuVr  “  -[(1-v  -  pv)  +  (l+v)b^/r ^]p/(b^7a^-l) 


(18) 

(19) 

(20) 
(21) 
(22) 


The  residual  stress  system,  which  will  be  denoted  by  two  primes,  is  the  sum  of 
the  system  produced  by  loading  and  that  produced  by  unloading,  l*e.,  -  Oj. 

+  ,  etc-  Assuming  the  kinematic  hardening  rule  and  using  Tresca’s 

criterion  subject  to  >  Og”,  the  reverse  yielding  will  not  occur  if 

(a^--(^)  -  (00-00)  <  Kq  (23) 


Substituting  the  loading  and  unloading  solution  into  Eq,  (23),  we  can 
determine  the  minimum  pressure  (p^)  reverse  yielding  to  occur-  The 

equation  for  is  given  by 

Pm/^o  °  (1-a^/b^)  (24) 

Equating  (17)  to  (24),  we  can  determine  the  maximum  amount  o£  overstrain  for 
reverse  yielding  not  to  occur. 


IV,  ELASTIC-PLASTIC  UNLOADING-  Now  suppose  that  the  loading  has  been 
such  that  the  Internal  pressure  Is  larger  than  given  by  Eq-  (24).  On 

unloading,  yielding  will  occur  for  a  <  r  <  p’  with  p'  <  p-  Using  the 
kinematic  hardening  rule  during  unloading  and  assuming  o^"  >  >  00*’,  we 

have 


(ar"-ar")  -  (a9"-ci0")  =  Ko  (25) 

where 

,  ae"  =  cee"'’  (26) 

Since  the  residual  stress  system  is  the  sum  of  two  systems  produced  by  loading 
and  unloading,  combining  Eqs-  (1)  and  (25)  leads  to 
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(Op'-Oir*)  -  (Oe'-ae')  =  2Kq 


(27) 


where 

cxj.'  =  ,  oq’  =  ce0’^  (28) 

During  elastic-plastic  unloading,  the  associated  flow  theory  states  that 

de0’‘  =  -dGr'^  **  ®  =  0  (29) 

p 

It  has  been  assumed  that  the  sign  of  de©'  is  the  same  throughout  the 
unloading  process  and  that  is  negative.  This  will  be  the  case  when  the 
internal  pressure  is  removed  during  unloading.  Since  -  tz'®  - 

known,  we  can  use  Hooke's  law  and  the  equilibrium  equation, 

do^'/dr  =  (<7e'"®r')/^ 

to  express  o^'  in  terms  of  o^.' 

O2'  =  E^z’  +  2vaj.'  +  vr(daj.'/dr)  (31) 

Since  the  dilation  is  purely  elastic 

(du’/dr)  +  u’/r  +  Ez’  =  E-l(l-2v)(ar'+a9’+Oz’)  (32) 

On  integration  using  Eqs.  (30)  and  (31),  we  obtain 

ru'  =  (l-2v)(l+v)E" Op'  -  ve^'r^  +  A.  .  (33) 

where  A  is  a  constant-  The  strain  components  can  be  expressed  In  terms  of  o^.' 
and  (do^'/dr).  The  plastic  strain  components  are 

£3'^  =  =  Ar"^  -  (I- v^)E~  ^r(dOj.'/dr)  (3A) 

Using  Eq.  (31)  together  with  Eqs.  (27)  and  (28),  we  have 

r(dar'/dr)  =  -2(Ko-cee'^)  (35) 

Substituting  Eq.  (35)  into  Eq.  (34)  and  determining  the  constant  A  by  the 
condition  e0'P  =  0  at  p',  we  obtain 

A  =  -2(1-v2)(Kq/E)p'2  (36) 


and 

(E/Ko)e0'^  =  -2(p'2/r2-l)/[2c/E  +  (l-v^)-!]  (37) 

Equations  (35)  and  (36)  with  the  boundary  condition  at  r  =  a  suffice  to 
determine  Oj-'  in  the  plastic  region-  The  expressions  for  the  stresses  in  (a  ^ 
r  <  p')  are  given  explicitly  by 

o^'/Kq  »  p/Kq  -  n8(p' Va^-p' ^/r^)  -  2(l-n8)log(r/a)  (38) 
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Oe’/Ko  =  Or'/Ko  -  2  -  2nB( p*  ^/r 2-1) 


(39) 


Oz’/Kq  *  v(Or'+O0')/Ko  -  Ee^/Ko 

(40) 

and  in  (p'  <  r  <  b)  given  by 

a^’/K„ 

(41) 

oeVKo 

(42) 

a^'/Ko  =  -2vp'2/b2  -  ECz/Ko 

(43) 

The  continuity  condition  of  determines  the  relation  between  p*  and  p  as 
given  by 

P  * 

1  -  p'2/b2  +  2(l-n3)  log  --  +  n8(p'2/a2-i) 

a 

=  -  (l-p2/b2)  +  (l-n3)Xog(p/a)  +  -  n3(pVa^l)  (44) 

2  2 

The  yield  surface  moves  in  translation  during  elastic-plastic  unloading 
according  to 

“0*  ““r'  c^e'^  (45) 

p 

where  Cq'  Is  given  by  Eq.  (37). 

V.  NUMERICAL  RESULTS  AND  DISCUSSIONS.  Consider  a  closed-end  thick- 
walled  cylinder  with  the  following  parameters:  b/a  =3,  v  =  0.3,  and  E/K^j  = 
200.  The  numerical  results  for  the  displacements,  strains,  and  stresses 
during  elastic-plastic  loading  and  unloading  are  calculated.  Figure  1  shows 
the  relationship  between  the  internal  pressure  factor  (p/K^)  and  the 
dimensionless  elastic-plastic  interface  (p/a)  for  various  values  of  the 
hardening  parameter,  m  =  0,  0.05,  0.1,  and  0.2.  The  displacements  at  the 
inside  and  outside  boundaries  of  the  tube  (U^  and  Ujj)  are  shown  in  Figure  2  as 
functions  of  the  elastic-plastic  interface  for  m  =  0.1.  The  solid  and  dotted 
curves  represent  the  displacements  during  loading  and  after  unloading, 
respectively.  Figure  3  shows  the  distribution  of  hoop  stresses  (oq)  during 
loading  for  p/a  =  1.0,  1.5,  2.0,  2.5,  and  3.0.  After  complete  unloading  from 
different  stages  of  loading,  the  corresponding  residual  hoop  stresses  (oq”) 
are  shown  in  Figure  4.  Reverse  yielding  occurs  in  a  strain-hardening  tube 
with  m  “  0.1  only  when  the  plastic  portion  (p)  is  larger  than  1.652  a.  In 
order  to  show  the  effect  of  hardening  parameters  (m)  on  the  residual  stress 
distribution,  the  numerical  results  are  presented  in  Figure  5  for  m  “  0,  0.05, 
and  0.10.  As  can  be  seen  in  the  figure,  larger  values  of  hardening  parameter 
(m)  tend  to  reduce  the  beneficial  residual  hoop  stress  at  the  bore. 

All  the  results  presented  in  Figures  1  through  5  are  based  on  the 
kinematic  theory.  Hie  have  also  calculated  the  results  based  on  the  isotropic 
hardening  theory  [2] .  Figure  6  shows  a  comparison  of  two  hardening  rules  for 
the  residual  hoop  stresses  in  a  closed-end  thick-walled  cylinder.  The  dotted 
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curves  represent  isotropic  hardening  model  with  no  Bauschinger  effect. 
According  to  this  model,  there  Is  no  reverse  yielding.  The  solid  curves  show 
the  Bauschinger  effect  and  reverse  yielding  occurs  in  both  cases,  p'/a  = 

1.098,  1.336,  for  p/a  =  2.0,  3.0,  respectively.  The  residual  hoop  stresses  at 
the  bore  are  Oq'VKq  =  —0.729,  -0.327  for  p/a  =  2.0,  3.0,  respectively. 
According  to  isotropic  hardening  rule,  those  values  of  should  be 

-1.060,  -1.318  for  p/a  =  2.0,  3.0,  respectively.  These  numerical  results 
indicate  that  the  effect  of  hardening  rules  on  the  residual  hoop  stresses  is 
quite  significant,  especially  near  the  bore.  Many  plasticity  theories  for 
reverse  yielding  have  been  proposed  and  reviewed  [9] ,  and  many  computer 
programs  have  been  developed  [10] .  It  is  believed  that  the  numerical  results 
based  on  other  theories  will  fall  within  the  limits  obtained  by  using  the 
kinematic  and  isotropic  hardening  models. 
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Figure  1.  Effect  of  hardening  on  the  relation  between 
Internal  pressure  and  elastic-plastic 
Interface. 
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ABSTRACT 

Representation  theorems  for  tensor  functions  together  with 
the  uniaxial  formulation  of  the  theory  of  viscoplasticity  based  on 
overstress  are  used  to  present  a  fully  invariant  three-dimensional, 
small-strain,  nonlinear  orthotropic  theory*  To  facilitate  identifi¬ 
cation  of  the  material  functions,  ’’elastic  anisotropy”  and  ’’inelastic 
anisotropy”  are  related  by  a  scalar  function. 

In  each  preferred  direction  the  stress  is  composed  of  time 
(rate) -independent  (or  plastic)  and  viscous  (or  rate-dependent)  contri¬ 
butions,  It  is  demonstrated  that  these  contributions  can  vary  in  the 
different  preferred  directions.  It  is  possible  to  have  almost  linear 
elastic  response  in  one  direction  while  the  other  two  directions  exhibit 
inelastic  deformation.  While  normally  the  stress-strain  curve  with  the 
highest  elastic  modulus  will  also  exhibit  the  highest  stress  in  the 
plastic  range,  this  tendency  can  be  reversed  by  a  judicious  choice  of 
the  invariants. 

The  above  properties  are  demonstrated  by  numerical  experiments 
which  include  monotonic  and  cyclic  loading  at  constant  strain  rate  as 
well  as  relaxation  tests. 
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1.  INTRODUCTION 


Recently  the  modeling  of  inelastic  behavior  of  metals  has  been 
changing  from  predominantly  yield  surface  oriented  approaches  to  others, 
which  use  different  concepts  such  as  the  endochronic  time  [1]  for  the  time- 
independent  case.  For  rate-dependence,  the  unified*  theories  [2-5]  were 
developed  which,  like  the  endochronic  theory,  do  not  use  the  formalism 
associated  with  yield  surface  plasticity.  These  theories  are  proposed  for 
the  uniaxial  case  and  for  isotropy  when  three-dimensional  formulations  are 
considered . 

The  viscoplasticity  theory  based  on  overstress  is  of  the  unified 
type  and  considers  rate-dependence  fundamental  [6,7],  It  was  previously 
developed  as  a  deformation  theory  [8]  and  is  now  extended  to  cyclic  neutral 
behavior  [6,7],  As  in  the  case  of  the  unified  theories,  only  an  isotropic 
formulation  has  been  proposed  so  far* 

When  considering  anisotropy  in  the  inelastic  deformation  of  metals, 
it  is  necessary  to  distinguish  between  deformation  induced  anisotropy  and  nat¬ 
ural  anisotropy.  For  the  former,  an  initially  isotropic  material  may  become 
permanently  anisotropic  after  inelastic (plastic)  deformation.  The  latter  is 
essentially  present  in  single  crystals,  metal  matrix  composites  and  direc¬ 
tionally  solidified  alloys.  Frequently  the  anisotropy  induced  by  a  kinematic 
variable  (backstress)  is  also  considered  separately. 

Within  the  context  of  the  theory  of  viscoplasticity  based  on 
overstress,  an  orthotropic  formulation  is  introduced  in  this  paper.  The  aim 
of  the  formulation  is  not  to  provide  the  most  general,  tensorially  invariant 
representation.  Rather,  a  simplified  version  is  developed  which  captures 
key  anisotropic  phenomena  and  renders  the  identification  of  real  material 

*The  term  "unified”  is  used  to  indicate  that  creep  and  plastic  strains  are  not 
considered  separately;  all  inelastic  deformation  is  rate  dependent  and  is 
modeled  through  the  inelastic  strain  rate. 
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properties  through  testing  as  simple  as  possible.  To  this  end  it  is  assumed 
that  elastic  anisotropy  and  inelastic  anisotropy  are  simply  related.  The 
deformation  induced  anisotropy  is  not  included  but  anisotropy  due  to  the 
development  of  the  kinematic  variable  (backstress,  equilibrium  stress)  is 
part  of  the  formulation. 

In  the  present  analysis,  tensor  function  representations  and  the 
uniaxial  formulation  of  the  viscoplasticity  theory  based  on  overstress  [6] 
are  used  to  synthesize  an  orthotropic  representation  using  anisotropic 
elasticity  theory  as  a  guide.  Tensor  function  representation  theorems  are 
very  useful  and,  to  the  knowledge  of  the  authors,  have  not  been  employed  in 
the  derivation  of  anisotropic  viscoplasticity  theories. 

After  some  introductory  remarks  the  evolution  equations  for  the 
inelastic  strain  rate  and  for  the  equilibrium  stress  are  given  in  chapter  2, 

To  facilitate  identification  it  is  assumed  that  elastic  anisotropy  and 
inelastic  anisotropy  are  the  same,  i,e.  the  respective  "fourth-order  material 
tensors"  are  related  by  a  scalar  multiple.  Inelastic  incompressibility  is 
not  considered  in  this  paper  but  has  been  formulated  elsewhere  [9,10] *  In 
chapter  3  the  equations  are  specialized  for  various  simple  tests  which  are 
important  in  the  identification  procedure.  It  is  demonstrated  that  the  theory 
is  capable  of  reproducing  elastic  response  in  one  direction  while  the  other 
two  directions  exhibit  inelastic  behavior.  The  capabilities  of  the  proposed 
theory  are  illustrated  by  numerical  experiments  (numerical  integration  of 
the  coupled  nonlinear  differential  equation  system  using  hypothetical  but 
realistic  material  properties) . 


309 


2.  EVOLUTION  EQUATIONS 
2.1  Introductory  Remarks 

Anisotropy  has  been  considered  within  the  context  of  time-independent 
yield  surface  plasticity,  e.g.  [11,12]  to  cite  just  a  few,  and  for  visco¬ 
plasticity  [13-16] .  The  present  approach  differs  from  the  others  by  the  use 
of  representation  theorems  for  isotropic  tensor  functions  using  "geometric" 
tensors  and  the  usual  deformation  variables  as  arguments.  To  this  end,  the 
constitutive  relations  for  orthotropic  materials  depend  on  three  symmetric 
geometric  tensors  m,  n,  Z,  which  are  formed  as 


m .  . 

=  M, 

M. 

(1) 

1 

J 

n ,  , 

N. 

N. 

(2) 

ij 

1 

J 

SL,, 

^  L, 

L, 

(3) 

ij 

1 

J 

as  well  as  the  usual  mechanical  variables.  The  unit  vectors  N^,  and 
are  mutually  orthogonal  to  each  other  and  parallel  to  the  intersections  of 
the  three  orthogonal  symmetry  planes  which  define  the  state  of  orthotropy. 

The  infinitesimal  total  strain  rate  is  the  sum  of  elastic  and 
inelastic  parts 


e. . 

ij 


•  e  ,  'in 

.  +  e. . 

ij  iJ 


(4) 


The  elastic  strain  rate  is  governed  by  the  hypoelastic  law 

=  S..  a  .  (5) 

ij  ijpq  pq 

The  inverse  of  (5)  is 

a..  =  .  (6) 

ij  ijpq  pq 


Representations  of  S  and  Q  for  orthotropy  are  given  in  the 

Appendix. 
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Using  (4),  Eq.(6)  can  be  rewritten  as 


a. .  =  C..  (e  -  . 

ij  ijpq  pq  pq 


(7) 


2.2  Evolution  of  Inelastic  Strain 


■rn 


The  inelastic  strain  rate  is  assumed  to  depend  on  a  quantity 

called  overstress  [6-8]  and  denoted  by  X  . 

ij 

•in 


eT.  =  F.  . [X  ,  m  ,  n  ,  I  ] 
ij  ij  pq  pq  pq'  pq 


where 


(8) 


X  =  a  -  g 

pq  pq  pq 


(9) 


The  quantity  g  is  the  equilibrium  stress  for  which  an  evolution  law  is 

pq 

needed.  The  function  F.,  is  an  isotropic  tensor  valued  function  of  the 

ij 

arguments  enclosed  by  square  brackets. 

The  complete  representation  of  is  available  from  [17,18] 

•  in 

e  =  k  m  +  k  n  +  k  ^,  +  k,  (mX  +  X  m) 

--  1-  2  ~  3-  4~~ 


+  k,(nX  +  Xn)  +  k-(JlX  +  XJl) 

^ -  b - 


(10) 


where  the  scalar  functions  depend  on  seven  invariants 


tr  m  X  ,  tr  n  X  ,  tr  £  X 


2  2  2  3 

tr  mX  ,  tr  nX  ,  tr  £X  ,  tr  X 


(11) 


By  choosing 


0 


(12) 


the  quadratic  term  X  is  eliminated  in  the  present  study  in  accordance 
with  the  uniaxial  overstress  model  [6-8], 
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Retention  of  this  term  offers  the  possibility  of  modeling  second^order 
effects  within  the  context  of  finite  deformation  which  will  not  be  explored 
here. 

Thus  the  complete  representation  of  (10)  can  be  given  as 


•  in 


k.  .  X 

ijpq  pq 


(13) 


where  the  components  of  the  fourth-order  tensor  k  are  functions  of  the 
scalar  invariants  and  the  geometric  tensors  given  in  (11)  and  (l)-(3), 
respectively  * 

In  order  to  reduce  the  complexity  of  the  fourth-order  tensor  k 
further,  we  will  introduce  the  concept  of  **anisotropy  ratios.”  These 
ratios  can  also  be  defined  for  elasticity.  As  an  example,  the  ratio  of 
Young’s  moduli  in  two  different  directions  in  the  material  is  an  ’’anisotropic 
ratio.”  Thus,  (10)  becomes 

=  k  r..  X  .  (14) 

ij  ij  pq  pq 

The  components  of  the  dimensionless  quantity  r  are  called 
"inelastic  anisotropy  ratios,"  In  general  they  may  be  functions  but  will 
be  constants  in  this  paper.  The  representation  of  r  for  orthotropy  is 
given  in  (A-10) . 

The  quantity  k  has  dimensions  of  (time-stress)  ^  and  is  a  scalar 
function  of  the  set  of  invariants  given  in  (11).  This  function  is  simply 
related  to  "the  viscosity  function"  of  the  isotropic  or  uniaxial  theory 
[6-8].  It  governs  the  strain-rate  dependence  of  the  uniaxial  stress-strain 
curves  in  different  material  directions. 

For  practical  reasons  a  single  argument  denoted  by  J  is  proposed 
for  the  viscosity  function  k 

=  tr(rX)(rX)  ,  (15) 
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It  is  possible  to  use  a  different  r  in  (15)  than  the  one  given  in  (lA)  , 


2 * 3  Evolution  of  the  Equilibrium  Stress 

An  evolution  law  for  the  equilibrium  stress  can  be  developed 
from  the  representation  theorems  [17,18],  In  generalizing  the  uniaxial  over¬ 
stress  model,  the  rate  of  equilibrium  stress  is  an  isotropic  tensor  function 
of  six  symmetric  tensors 

•  -1 

(16) 


IJ  IJ  pq  pq  pq  pq  pq  pq 


where 


g  .  .  =  g  ,  .  -  f  ,  . 


(17) 


and  f  is  the  generalization  of  the  f  function  of  [6]*  Its  derivative 
represents  the  final  slope  of  the  stress-strain  relation  in  the  plastic 
range. 


The  complete  representation  of  (16)  is  too  general  and  lengthy 
for  present  purposes.  Instead,  the  foms  proposed  in  [10]  will  be  con¬ 
sidered  (see  Eqs.(18)  and  (20)  of  [10]),  A  special  case  which  retains 
sufficient  generality  for  the  present  is 


•  -1 

g  .  ,  =  ^  r  .  .  Y 

ij  ijpq  pq 


(18) 


The  scalar  is  a  decreasing  function  of  overstress  X  and  is  initially 
slightly  less  than  Young's  modulus  in  one  of  the  preferred  directions,  e.g. 


The  anisotropy  ratio  r  ^  is  described  after  (A-10) . 


(19) 


If  then  (7),  (14)  and  (18)  represent  only  linear  elastic  behavior. 

Any  positive  results  in  inelastic  behavior.  Usually  £^^  =  0.99  E^. 
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3,  UNIAXIAL  TESTS  IN  PREFERRED  DIRECTIONS  FOR  CONSTANT  ANISOTROPIC  RATIOS 
If  we  assume  that  anisotropic  ratios  remain  the  same  for  both 
elastic  and  inelastic  deformations,  then  the  general  formulation  of  the 
previous  section  simplifies  considerably.  In  this  case 
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(24) 


r  =  S 


r 


(25) 


where  the  quantities  S,  C  and  r  were  introduced  in  (5),  (6)  and 
(14).  The  explicit  forms  of  S  and  C  are  given  in  the  Appendix, 

The  stress-  and  strain-tensor  components  are  assumed  to  be  given 
in  the  preferred  coordinate  system  represented  by  M,  N  and  The  uniaxial 

tests  are  carried  out  in  the  preferred  directions  and  then  the  geometric 
tensor  components  of  (1)  -  (3)  are 


all 

m  .  -  0 

except 

mil  =  ^ 

all 

n  ,  .  0 

iJ 

except 

"22  "  ^ 

(26) 

all 

=  0 

11 

except 

2,33  =  1  . 

3,1  Uniaxial  Tension  in  the  z-Direction 

In  a  uniaxial  tension  test  in  the  preferred  z  or  3  direction  all 

the  components  of  the  stress  tensor  are  zero,  except  which  corresponds 

to  the  applied  load.  The  lateral  components  of  the  equilibrium  stress 

tensor  are  zero.  The  shear  components  vanish  due  to  material  symmetry. 

The  evolution  law  for  O  is  obtained  from  (7)  by  using  the 

z 

quantity  C  given  in  the  Appendix 


where 


* 

a 

z 


E.(e 
3  z 


•rn 

e 

z 


) 


•rn 

£ 

z 


r^(a  -  g^)k 
2  Z  2 


(27) 

(28) 


The  strain  rate  e  =  constant  is  the  input  parameter, 
z 

Since  the  formulation  corresponds  to  a  constant  Poisson's  ratio, 
the  lateral  strain  rates  are  also  constant 
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(29) 


e  =  - 


31  z 


G  ^  - 


^32  S 


(30) 


The  lateral  inelastic  strain  rates  are 


•in 

•in 


•  in 

^r  E 

31  z 


•in 

E 

32  z 


(31) 

(32) 


The  argument  V  of  the  function  k  is  determined  from  (15)  using 
(28),  (31)  and  (32) 


2  2  2  2  2 

Ti  =  (r3^  +  r32  +  Dr^Ca^  -  g^) 


(33) 


The  only  nonzero  component  of  the  equilibrium  stress  has  the 
following  evolution  law  which  is  obtained  from  (18)  using  (25) , 


I 


\  z  g  2  I 


(34) 


where  T  is  obtained  from  (21)  as 
g 

2  2  2  2  2  2 
Tg  =  (r^^  +  r^2  + 

and  4)  is  given  by  (23)  ,  Note  that  all  components  of  f  ^  are  zero  except 


(35) 


f  -  P.  e 

z  1  z 


(36) 


as  can  be  seen  from  (22) , 

For  simplicity  let  us  assume  that  f^^  is  zero,  and  set  =  0  • 
The  asymptotic  solutions  of  (27)  and  (34)  will  occur  when 


E 

z 


•in 

E 

2 


(37) 
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and 


2 

r  =  1 


(38a) 


or,  correspondingly 


f  2  ,  _l/2 

r^Crji  +  rjj  +  1) 


(38b) 


which  render  a  =  g  =0. 

z  z 

Notice  that  the  final  value  of  the  equilibrium  stress  is  a 

* 

constant  which  is  independent  of  the  input  strain  rate  , 

Equation  (37)  is  a  nonlinear  algebraic  expression  of  the  over-^ 

stress  o  -g  •  For  different  input  strain  rates,  different  asymptotic 
z  z 

overstress  values  are  obtained. 


3 . 2  Uniaxial  Tension  in  the  x-Direction 

Similar  procedure  is  applied  to  the  case  of  constant  strain-rate 
loading  in  the  x-direction.  We  will  compare  the  asymptotic  properties  of 
this  case  to  that  of  the  previous  case. 

The  evolution  of  is  governed  by 


*  ^  /•  ’in. 

a  =E.  (e  -  £  ) 

X  lx  x 


(39) 


where 


•  f  \  1 

e  =  (a  -  g  )k 

X  X  X 


(40) 


The  laterial  total  and  inelastic  strain  rates  are  governed  by 

•  1  • 

e  -  -  M  —  e 
y  21  E„  X 


(41) 


•  1  • 

■  ^31  ^x 


(42) 


•in  'in 

e  =  -r-T  r  £ 
y  21  1  X 


(43) 


• in  'in 

e  =  -r  r„  £ 

2  31  2  X 


(44) 


317 


The  argument  V  of  the  function  k  is 


(45) 


The  nonzero  component  of  the  equilibrium  stress  is  governed 


by 


g  =  il>(i  ~  r  (1  -  4>)  ; 

X  \  X  2  X  /  ■ 


(46) 


where  functions  and  ((^  are  the  same  as  the  previous  case,  and  T  is 

& 


‘’s'  (‘■^21  +  <'31 ’^2>^'"  ^  )<«K-V^<*' 


(47) 


The  nonzero  component  f^  of  f  _  is  given  by 


f  -  P,  e 

X  lx 


(48) 


The  asymptotic  solutions  of  (39)  and  (46)  ,  when  =  0  and 
({>=0,  will  occur  when 


e  =  =  (a  -g  )  k[rj 

XX  X  X  2. 


and 


(49) 


Sx  = 


(<'21 +  (1^31 


(50) 


Thus  the  final  value  of  the  equilibrium  stress  is  the  constant 
on  the  right-hand  side  of  (50)* 

The  spacing  of  the  equilibrium  stresses  and  g^  of  these  two 
uniaxial  tests  is  governed  by 


!x 

g.  *■  ’'2 


/-  2  2  _l/2 

^^31  ^32  _ 


(<'21  +  l) 


1/2 


(51) 
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which  is  obtained  from  (50)  and  (38b).  Since  the  equilibrium  stress  is 
a  measure  of  time- independent  change  in  the  material,  the  ratio  in  (51) 
can  be  called  "plastic  anisotropy.” 

The  strain-rate  spacing  of  the  uniaxial  test  curves  in  the 
x-direction  is  governed  by  the  nonlinear  algebraic  equation  (49) . 
Different  spacings  predicted  in  x-  and  z-directions  by  (49)  and  (37)  for 
the  same  input  strain  rate  can  be  termed  "viscous  anisotropy.”  The  ratio 
of  the  spacings  is  governed  by 


a 

X 


a 

z 


k[r^] 


(52) 


•  • 

where  £  =  E  and  F,  and  F^ 
z  X  1  2 


are  given  by  (33)  and  (45),  respectively. 


4.  DISCUSSION 


4,1  General 

The  fully  invariant  three-dimensional  formulation  is  a  simplified 
version  of  the  theory  presented  in  [9] ,  Here  it  is  assumed  that  the  elastic 
anisotropy  ratio  S  (see  (A, 10))  is  equal  to  r  appearing  in  (14),  (18)  and 
(22)  which  govern  the  evolution  of  the  inelastic  strain  rate.  This  choice 
simplifies  the  identification  of  the  material  functions  to  be  discussed 
shortly.  As  a  consequence  of  this  choice,  constant  Poisson's  ratios  are 
found  in  the  uniaxial  tests;  see  (29)-(32)  and  (41)-(44),  This  restriction, 
which  is  judged  to  be  acceptable  for  the  present  purpose,  is  removed  in  [9] 
where  a  formulation  with  variable  Poisson’s  ratio  is  given. 


4 , 2  Ident if icat ion 

Rate  dependence  of  the  uniaxial  stress-strain  curves  in  a  particular 

7* 

direction  in  the  material  are  governed  by  the  function  k  in  a  nonlinear  manner. 


The  function  k  in  this  paper  is  equivalent  to  (1/Ek)  in  previous  papers. 
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see  (37)  or  (49),  and  the  invariant  F.  They  control  viscous  anisotropy. 

In  principle  the  identification  procedure  is  as  follows.  Uniaxial 
tests  in  a  certain  preferred  direction,  say  x,  are  performed  at  piecewise 
constant  strain  rates  during  one  test.  Then  a  k  function  is  introduced  which 
can  reproduce  the  same  strain-rate  spacing  as  found  in  the  experiment.  Thus 
the  relaxation  and  creep  properties  in  the  x-direction  are  also  determined, 
see  [6] . 

The  strain-rate  spacing  (and  correspondingly  the  creep  and  the 

relaxation  properties)  in  some  other  direction,  say  z,  can  be  adjusted  through 

the  quantity  r  which  is  used  to  calculate  F  in  (15) .  It  is  not  necessarily 

the  same  as  that  of  (14),  (18)  and  (22),  although  this  is  assumed  in  the  body 

of  the  paper.  (An  exception  is  made  in  the  numerical  examples  to  show  the 

versatility  of  the  approach.)  An  analytical  expression  is  given  in  (52)  for 

the  ratio  of  the  uniaxial  overstress  values  in  the  x-  and  z-directions .  This 

ratio  depends  on  r^  of  r  in  (14),  and  the  function  k  evaluated  at  two  different 

arguments  F^  and  F^*  which  are  given  by  (33)  and  (45)  for  x-  and  z-directions, 

respectively.  In  such  a  way,  viscous  anisotropy  is  reproduced. 

The  plastic  (rate- independent)  anisotropy  is  governed  by  the 

asymptotic  spacing  of  the  uniaxial  equilibrium  stress  curves  and  controlled 

by  the  invariant  F  .  As  in  the  case  of  F,  quantity  r,  which  is  used  to  cal- 

culate  F  in  (21),  does  not  have  to  be  the  same  as  that  of  (14)  or  (15).  The 
g 

asymptotic  g  values  for  uniaxial  tests  in  x-  and  z-directions  and  their  ratio 
are  given  by  (35b),  (50)  and  (51)  in  terms  of  the  particular  r  in  (21),  They 
can  be  used  for  the  adjustment  of  material  data. 

4 , 3  Illustration  of  the  Theory  by_  Nutnerical  Experiments 

It  was  mentioned  that  the  proposed  simplified  approach  offers 
flexibilities  by  choosing  the  anisotropy  ratios  in  the  invariants  F  and  F^ 
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in  different  ways*  Specifically  three  cases  can  be  identified. 

(i)  The  quantities  r  in  (14),  (15)  and  (21)  are  all  equal  to  each 
other  which  are,  in  turn,  set  equal  to  S  . 

(ii)  The  quantity  r  for  the  inelastic  strain  rate  in  (14)  is  set 

equal  to  S.  The  quantities  in  (15)  and  (21)  are  equal  but 
different  than  S, 

(iii)  The  quantity  r  in  (14)  is  set  equal  to  E^  S.  The  other  quantities 
in  (15)  and  (21)  are  chosen  independently. 

In  case  (i)  the  inelastic  properties,  including  viscous  and  plastic 
anisotropy,  are  nonlinearly  governed  by  the  elastic  properties. 

This  behavior  is  illustrated  by  numerical  experiments  (the  numerical 
integration  of  the  coupled  system  of  nonlinear  differential  equations)  using 
the  material  data  listed  in  Table  1.  It  should  be  noted  that  the  material 
constants  were  chosen  so  that  transverse  isotropy,  a  special  case  of  orthotropy 
is  modeled.  The  preferred  direction  is  the  z-  (or  3«)  direction.  As  a  conse¬ 
quence  the  behaviors  in  the  x-  and  y-directions  are  identical. 

Figure  1  shows  the  results  of  a  tensile  test  with  a  strain  rate  of 
«  —4  —1 

e  =  10  s  .  Both  the  components  of  0  and  g  in  the  z-  and  x-directions  are 

plotted.  It  is  seen  that  a  > a  and  g  >g  at  all  times  corresponding  to 

X  Z  X  z 

(E  =  E  )  >  (E  =E  ).  At  the  end  of  the  graph  the  asymptotic  values  are  approxi-^ 
X  r  z  ^ 

mately  reached  and  correspond  to  the  theoretical  predictions,  (It  should  be 
noted  that  in  Fig,  1  and  in  subsequent  figures  is  chosen  differently  from 
zero  resulting  in  a  positive  final  slope  of  the  a  and  g  curves,) 

The  relaxation  behavior  and  the  hysteresis  loops  predicted  by  the 
theory  are  plotted  in  Figs,  2  and  3,  respectively. 

The  different  amounts  of  stress  reduction  during  the  240  s  relaxation 
time  (AB  vs,  A’B*  in  Fig.  2)  are  obvious.  In  the  cyclic  case  a  considerable 
amount  of  Bauschinger  effect  is  evident.  It  is  also  of  interest  that  the 
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hysteresis  loops  are  not  quite  closed  indicative  that  the  asymptotic  values 
are  not  yet  reached  at  1.2%,  (The  theory  presented  herein  represents  cyclic 
neutral  behavior,  see  [6,7].)  In  regions  where  the  a-  and  g-curves  coincide 
elastic  behavior  is  represented.  It  can  be  observed  that  the  elastic  regions 
in  the  two  directions  are  different. 

In  cases  (ii)  and  (iii)  the  viscous  and  the  plastic  anisotropy  become 
uncoupled  from  the  elastic  properties.  It  is  now  possible  to  adjust  the  ratio 
of  the  uniaxial  overstress  values  and/or  the  asymptotic  spacing  of  the  uniaxial 
g-curves  independently . 

However,  it  should  be  noted  that  the  ratios  of  the  lateral  strains 
to  the  uniaxial  strains  remain  fixed  and  are  governed  by  the  elastic  Poisson’s 
ratios , 

A  numerical  example  is  given  for  case  (ii)  in  Fig.  4.  The  material 
data  is  the  same  as  that  of  Figs,  1,2  and  3,  except  r^  in  (15)  and  (21)  is  set 

equal  to  zero.  Thus  the  invariants  T  and  T  become  zero,  see  (33)  and  (35). 

8 

As  a  consequence  k  in  (28)  is  constant  and  equal  to  k[0]  which  is 
typically  a  very  small  number.  This  choice  renders  the  inelastic  strain  rate 
very  small.  Similarly  the  evolution  of  is  a  straight  line  with  modulus 
t|/[0]/r2-  ^^i®  choice  gives  nearly  elastic  behavior  in  the  z-dir action  without, 

of  course,  affecting  the  response  in  the  x-direction.  These  properties  are 
illustrated  in  Fig.  4 

Another  interesting  example  of  case  (ii)  is  the  modeling  of  nearly 
elastic  response  under  a  hydrostatic  state  of  stress 


(53) 

(54) 

(55) 
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in  (15)  and  (21)j  the  invariants  T  and  will  filter  out  all  the  hydrostatic 
components.  However  the  trace  of  the  inelastic  strain  rate  in  (14)  is  not 
zero. 

It  can  be  shown  that,  if  a  hydrostatic  state  of  stress  is  applied, 
the  equilibrium  stress  and  consequently  the  overstress  are  also  hydrostatic. 
The  choice  of  (53)  -  (55)  in  (15)  and  (21)  yields  F  =  F  =  0  in  this  case.  Thus 

O 

the  behavior  is  again  nearly  elastic. 

An  example  of  case  (iii)  is  given  in  Fig*  5.  The  material  data  is 
the  same  as  that  of  Figs.  1,2  and  3,  except  in  is  s€;t  equal  to  0,5  (note 
that  “  50/35  otherwise) .  The  behavior  in  the  x-direction  is  not 

affected  by  this  choice;  compare  Fig.  5  to  Fig,  1,  The  curves  for  uniaxial  a 

z 

and  coincide  with  those  of  Fig.  1  in  the  elastic  region.  Although  the 
Young’s  modulus  in  the  z-direction  is  smaller  than  E^,  the  asymptotic  value 
of  the  g  -curve  is  higher  than  that  of  g  due  to  the  choice  of  r^  =  0,5  in  F  ; 

2  g* 

see  also  (51) .  (thie  to  the  choice  of  functions,  the  evolution  for  g  has  not 

z 

reached  the  asymptotic  value  at  the  strain  limit  of  the  graph.)  The  amount 

of  overstress  in  the  z-direction  is  not  affected  by  F  ;  compare  Fig,  1  and 

8 

Fig.  5. 

4 .4  Additional  Remarks 

The  inelastic  deformation  is  not  volume  preserving  in  the  present 
formulation.  However,  a  model  with  isochoric  inelastic  deformations  is 
developed  in  [9]  and  will  be  presented  elsewhere.  The  theory  is  intended  for 
use  with  composites  as  as  well  as  ’’naturally”  anisotropic  metals.  Since 
experimental  confirmation  of  the  isochoric  nature  of  inelastic  deformation 
of  these  materials  seems  to  be  absent,  both  formulations  were  developed  in 
[9]. 


323 


When  initially  loaded,  the  response  of  the  present  theory  is 
symmetric  with  respect  to  the  stress-strain  origin;  i.e.,  the  stress-strain 
diagrams  in  tension  and  compression  are  congruent.  When  other  than  purely 
quadratic  invariants  are  retained  as  arguments  in  the  functions,  e.g.  in  ijj, 
"strength  differential  effects"  can  be  reproduced.  In  this  case  the  stress- 
strain  diagrams  in  tension  and  compression  are  not  congruent.  Examples  of 
models  of  such  behavior  are  given  in  [9]  . 
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Table  1 


MATERIAL  CONSTANTS  AND  FUNCTIONS  USED 
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STRESS  MPa 


Figure  1*  Uniaxial  loading  in  x-  and  z-directions  for  e  = 
( - )  stress;  ( - )  equilibrium  stress 


STRESS  MPd 


Figure  2,  Uniaxial  loading  in  x-  and  z-directions 
with  4  min.  of  relaxation  periods  at  e 


STRESS  MPa 
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Figure  4 . 


•  -4  -1 

Uniaxial  loading  in  x-  and  z-directions  at  e  =  10  sec 
Nearly  elastic  behavior  in  the  z-direction,  r_ =  0  in  T  and  F  . 

2  g 
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STRESS  MPa 


•  -4 

Figure  5.  Uniaxial  loading  in  the  x-  and  z-directions  at  e  -  10  sec 

Asymptotic  level  of  the  O  curve  adjusted  by  choosing  r 

z  ^ 

instead  of  E, /E_,  in  T  • 


CM 


A-1 


APPENDIX 


The  elastic  coefficient  tensor  introduced  in  (5)  can  be 


defined  implicitly  by  the  following 


where 
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The  representation  given  by  (A.I)  and  (A. 2)  is  taken  from 
Boehler  [18],  The  elastic  constants  S^,  i = 1-9  are  related  to  the 
so-called  engineering  constants  E^,  E^,  (Young's  moduli), 
(Poisson’s  ratios),  ^23’  *^12’  ^13  moduli).  The  relations 


given  as 


are 


S,  = 


S.  = 


S,  = 


(A. 3a) 
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in  matrix  form 
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^7  “ 
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Equation  (A.l)  can  also  be  put 
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where  the  S . ,  are  given  by 
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A-3 


The  quantity  6*.  is  related  to  the  Kronecker  6  .  and  assumes  different  values 

ij 


for  different  components  of  S 


ijkil’ 


6*.  =  “  6.,  for 
2  ij 


^1111*  ^2222’  ®3333’ 
^1122,  ^1133’  ^2233  * 
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,  for  , 

1 

S3323, 

^3313’ 

^23Z3’ 

^2313  ’ 

^2312’ 

^1313’ 

^1312’ 

^1212 

(A,  5b) 


for  S^^23»  ^2213’  ^3312  above  definitions  can  be  employed. 

The  expression  on  the  right-hand  side  of  (A. 5)  is  not  a  tensorial 
representation*.  It  should  be  viewed  as  a  formula  for  computing  the  components 
of  the  fourth-order  tensor  S  for  a  given  orientation  of  the  preferred  directions 
K . ,  N  and  L . . 


A  representation  of  the  fourth— order  tensor  of  orthotropy  based  on  the 
geometry  tensors  m,  ^  and  ^  is  given  in  [9] .  It  is  extremely  difficult 
to  relate  the  nine  constants  of  this  representation  to  those  of  (A. 5). 
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The  inverse  of  (A.l),  (or  equation  (7))  is 


where 
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The  elastic  constants  i-1,9  are  related  to  the  engineering 


constants  through 
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and 


A-:) 


2  2  2  ^2 

D  -  1  -  V21  -  ^31  -  '^32  ^  ^21  ^31  ^32 


(A.  9) 


Equation  (A. 6)  can  also  be  written  in  the  matrix  form  similar  to 
(A. 4).  The  components  of  C..  can  be  obtained  from  (A. 5)  by  simply 

ijpq 

replacing  by  given  in  (A. 8),  i“l,9. 

It  should  be  noted  that  there  are  some  restrictions  in  the 
engineering  constants  ,  etc.,  and  they  are  given  by  Jones  [19]. 

The  anisotropy  ratios  r  ,  used  in  the  text  can  be  obtained 
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from  (E_  S..  )  by  replacing 
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The  inverse  r  is  obtained  from  C/E^  via  the  same  procedure. 
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ABSTRACT.  The  stability  of  plane  Couette  flow  of  an  upper  convected 
Maxwell  fluid  is  investigated  using  a  numerical  method.  No  evidence  of 
instabilities  has  been  found.  The  essential  features  of  the  linearized 
spectrum,  as  well  as  those  of  the  numerical  approximation,  which  can  lead  to 
artificial  instabilities,  are  discussed. 

INTRODUCTION .  While  there  is  a  fairly  comprehensive  picture  of  the 
stability  of  parallel  shear  flows  of  Newtonian  fluids,  little  is  known  about 
viscoelastic  fluids.  Experimentally,  instabilities  of  flows  of  polymers  are 
observed  in  a  variety  of  geometries,  even  though  the  Reynolds  number  (a 
measure  of  the  effect  of  inertia)  is  small.  These  instabilities  are  caused  by 
the  elasticity  of  the  fluid,  which  is  measured  by  another  dimensionless 
parameter,  called  the  Weissenberg  or  Deborah  number.  There  seems  to  be 
considerable  controversy  over  the  precise  origin  of  these  instabilities,  e.g. 
whether  they  are  a  feature  of  parallel  sheekr  flows  or  originate  in  inflow  or 
outflow  regions;  for  a  review  of  experiments  and  various  attempts  at 
theoretical  explanations  see  [2] . 

Denn  and  his  coworkers  [1],  [31,  [5],  have  investigated  the  linear 
stability  of  plane  Poiseuille  flow  of  an  upper  convected  Maxwell  fluid.  In 
the  Newtonian  case,  this  flow  becomes  unstable  at  a  critical  Reynolds  number 

•  5772  .  Porteous  and  Denn  [3]  found  that  a  small  amount  of  elasticity 

decreases  the  critical  Reynolds  number  to  values  of  1000-2000  when  the 
Weissenberg  number  is  increased  to  1.  At  higher  values  of  the  Weissenberg 
number,  they  experienced  numerical  difficulties.  These  results  prompted  the 
conjecture  that  further  increase  of  the  Weissenberg  number  may  continue  to 
decrease  the  critical  Reynolds  number,  and  that  there  might  ultimately  be 
instabilities  even  at  very  low  Reynolds  numbers.  While  Rothenberger,  McCoy, 
and  Denn  [5]  claimed  to  have  found  such  instabilities,  a  later  study  by  Ho  and 
Denn  [1]  indicated  that  these  were  an  artifact  of  the  discretization,  and  no 
evidence  of  real  instabilities  at  low  Reynolds  number  was  found. 

In  this  paper,  we  look  at  linear  stability  of  plane  Couette  flow  for  an 
upper  convected  Maxwell  fluid.  In  contrast  to  Denn  and  his  coworkers,  who 
used  a  shooting  method  to  compute  individual  eigenvalues,  we  use  a  spectral 
method  and  a  matrix  eigenvalue  solver  to  compute  the  whole  spectrum  of  the 
problem.  In  this  way  we  can  obtain  a  more  ccwaprehensive  picture  not  only  of 
the  spectrum  of  the  continuous  problem,  but  also  of  the  behavior  of  the 
numerical  approximation  and  of  the  origin  and  nature  of  artificial  instabilities. 


Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-80K:-0041. 
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II*  FORMULATION  OF  THE  PROBLEM*  We  consider  the  flow  between  two 
parallel  plates  moving  in  opposite  directions  with  equal  velocities*  The 
direction  of  the  motion  of  the  plates  is  the  x-'direction  and  the  direction 
perpendicular  to  the  plates  in  the  y-direction.  We  non-dimensionalize  the 
problem  in  such  a  way  that  the  plates  are  at  y  =  1  and  y  =  -1  and  move  in  the 
x-direction  with  velocities  +1  and  -1,  respectively-  The  equations  of  the 
problem  are  the  equation  of  motion 

3^ 

+  (u  •  V)jLi]  =  -Vp  +  divT 

(1) 

div  u  -  0  ,  * 

and  the  constitutive  relation  for  an  upper  convected  Maxwell  fluid 
T  +  w[|“  +  (5  •  V)5  -  (7u)£  -  ^(Vu)*^]  =  Vu  +  (Vu)"^  , 


Here  R  is  the  Reynolds  number  cmd  W  the  Weissenberg  number.  Plane  Couette 
flow  is  the  trivial  solution 


u=  (y,0)  ,  T=  (2^  1)  . 


We  linearize  at  this  solution.  Since  Squire's  theorem  holds  for  the  upper 
convected  Maxwell  fluid  [6] ,  only  two-dimensional  disturbances  need  to  be 
considered.  We  separate  variables  and  look  at  disturbances  proportional  to 

e^®^e*^^  ,  and  we  express  the  perturbed  velocities  in  terms  of  a  stream 
function.  Ihe  resulting  eigenvalue  problem  is  [3] 

b,  (y)<|.'"  +  b^  (y)^"  +b-  (y)^  +  b.  (y)^ 


where 
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2  s*^(S  -  1)^ 

b  (y)  =  -2aS  2^-1-- - i-  , 


2  2  4 

.  .  .  4  „  2  S'  (S  +  1)  ^  ^  S' 

b^(y)  =  a  -  2a  - - - 
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The  boundary  conditions  are 


♦  “  *  0  at  y  =  ±1  • 

2 

For  the  numerical  treatment,  we  multiply  (4)  by  S  and  expand  (j)  in  a  series  of 
Chebyshev  polynomials.  This  leads  to  a  matr-^x  eigenvalue  problem  of  the  form 
det  (A  -  QB)  -  0  ,  which  was  solved  by  the  NAG  routine  F02GJF.  In  our 
COTiputations ,  we  used  between  24  and  60  Chebyshev  polynomials, 

III-  Discussion  of  results-  Before  discussing  our  numerical  results,  we 
outline  some  features  which  can  be  obtained  analytically.  For  S  =  0,  the 
equation  has  a  singular  point.  This  leads  to  a  continuous  spectrum  given  by 

S=0-CT=-^  +  ioy  ,  y  €  [-1, 1]  .  (8) 

If  S  0,  the  equation  is  regular,  and  eigenvalues  are  isolated.  An  exact 
solution  to  the  problem  can  be  obtained  in  the  limiting  case  a  *  0  .  If  we 

(4)  * 

assume  S  0,  we  obtain  =  aR(1  +  =  :a  if*'  .  (9) 

* 

The  eigenvalues  a  are  real  and  negative  and  tend  to  •  We  have 


a 


/  w  * 

-1  ±  +  4  a 

_ R  f 

2W 


(10) 


and  we  can  conclude  from  this  that,  except  for  a  finite  number,  the 
eigenvalues  have  real  part  -  ,  Even  for  a  0  ,  a  formal  asymptotic 

argument  assuming  0  large  and  very  oscillatory  eigenfunctions  leads  to  the 
conclusion  that  the  real  parts  of  the  eigenvalues  should  tend  to 

4#W 

w  * 

asymptotically*  The  Newtonian  eigenvalues  are  recovered  from  (10)  if  —  cr  is 

R 

small.  That  is,  the  spectrum  looks  more  Newtonian  if  W  is  small  and  R  is 
large. 


The  numerically  computed  eigenvalues  reflect  these  overall  features.  No 
evidence  of  instability  was  found  in  the  parameter  range  where  we  could  obtain 
converged  results.  For  W  in  excess  of  about  5,  round-off  errors  caused 
substantial  problems,  which  we  hope  to  overcome  in  the  future  by  using  higher 
precision  arithmetic.  For  the  Newtonian  case,  it  is  known  that  plane  Couette 
flow  is  always  stable  [4] . 

The  presence  of  a  continuous  spectrum  suggests  the  possibility  of 
eigenvalues  which  either  bifurcate  from  it  or  approach  it  in  certain  limits. 
Such  eigenvalues  were  indeed  found  in  otir  calculations.  In  the  following 
table  we  have  R=1,  a=1,  and  we  show  one  of  the  eigenvalues  as  a  function 
of  W. 


w 

Rea 

Imo 

2.0 

-0.370 

±0.774 

1.7 

-0.460 

±0.748 

1.5 

-0.543 

±0.730 

1.2 

-0.722 

±0.703 

1.0 

-0.900 

±0.687 

0.5 

-1.94 

±0.655 

0.2 

-4.97 

±0.646 

Clearly,  the  real  peurts  approach  -  ^  as  W  0  . 

It  Is  Interesting  to  look  not  only  at  converged  results,  but  also  at 
the  behavior  of  the  numerical  approximations.  It  is  well  known  that  the 
scheme  used  here  leads  to  "spurious"  inodes,  i.e.  to  eigenvalues  of  the 
discrete  problem  which  are  pure  artifacts  of  the  discretization  and  do  not 
approximate  those  of  the  differential  equation  even  in  a  qualitative  sense. 

In  our  calculations,  we  find  four  such  spurious  eigenvalues;  they  can  be 
stable  or  unstable.  The  following  discussion  is  concerned  with  the  remaining 
eigenvalues,  which  approximate  either  the  discrete  or  the  continuous  spectrum. 

Since  the  scheme  we  use  is  infinite  order  accurate,  a  number  of 
eigenvalues  in  the  discrete  spectrum  are  approximated  very  well  provided  we 
use  a  sufficient  number  of  Chebyshev  modes.  As  the  imaginary  parts  of  the 

eigenvalues  increase,  their  real  parts  first  approach  -  ,  reflecting  the 

behavior  of  the  continuous  problem.  However,  for  eigenvalues  with  large 
imaginary  parts,  the  accuracy  of  the  numerical  approximation  deteriorates,  and 

the  real  parts  of  the  computed  eigenvalues  move  away  from  -  ^  .  If  W  is 

sufficiently  large,  they  may  become  positive,  thus  leading  to  artificial 
instabilities.  It  is  not  clear  that  using  more  Chebyshev  polynomials  will 
remove  those  artificial  instabilities:  For  any  fixed  range  of  Imo  ,  the 
approximation  is  improved,  but  at  the  same  time,  eigenvalues  with  higher  and 
higher  Imaginary  parts  are  introduced,  and  the  instadiility  may  just  be  shifted 
to  those  higher  modes. 

1  1 

The  continuous  spectrum  extends  from  -  —  -  ia  to  -  —  +  io  .  Since  the 

infinite  order  accuracy  of  the  method  applies  only  to  Isolated  eigenvalues 

with  c"  eigenfunctions  and  not  to.  the  continuous  spectrum,  the  approximation 
is  relatively  poor.  It  is  better  near  the  ends  and  worse  towards  the 
middle.  For  high  values  of  W  and/or  a  ,  this  leads  again  to  artificial 
instabilities.  Since  these  artificial  instabilities  occur  near  Imc  «  0  ,  they 
are  particularly  damaging  if  one  tries  to  use  this  kind  of  scheme  for 
transient  calculations:  Unstable  eigenvalues  with  large  imaginary  parts,  such 
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as  those  resulting  from  the  numerical  approximation  of  the  discrete  spectrum, 
can  be  stabilized  by  using  appropriate  time  discretizations,  but  this  is  not 
possible  for  eigenvalues  close  to  zero,  we  suspect  that  the  artificial 
instabilities  found  by  Denn  and  his  coworkers  [1]/  15]  also  result  from  poor 
approximations  to  the  continuous  spectrum. 

As  expected,  the  quality  of  the  numerical  approximation  deteriorates  with 
increasing  w.  Also,  at  the  same  time,  the  potential  for  artificial 
instabilities  is  increased  due  to  fact  that  the  real  parts  of  the  eigenvalues 

are  proportional  to  -  ^  ,  i.e.  there  is  less  to  ** spare"  if  you  want  to  remain 

Stable.  Higher  Reynolds  numbers  tend  to  improve  the  features  of  the  numerics. 

Calculations  of  steady  flows  of  viscoelastic  fluids  have  so  far  had 
remarkably  little  success*  There  have  been  suggestions  that  flows  could  be 
computed  by  integrating  the  equations  forward  in  "time  and  waiting  for  the 
calculations  to  settle  down  to  a  steady  state.  The  features  found  in  this 
study  suggest  that  great  caution  is  advisable  with  this  approach.  We  found 
artificial  instabilities  resulting  not  only  from  spurious  modes,  but  also  from 
poorly  approximated  ones,  and  similar  problems  were  encountered  by  Denn  with  a 
different  numerical  scheme.  It  remains  an  open  question  as  to  whether  schemes 
can  be  constructed  to  avoid  such  problems. 
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EFFECT  OF  A  WALL  ON  THE  LIFT  FORCE 
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Introduction 


Particle-fluid  flaws  occur  in  filtration  processes  and  wear 
and  erosion.  Particle  motion  near  a  wall  is  a  complex 
question.  Whether  a  particle  actually  touches  a  wall  is 
complicated  by  short  range  forces  and  in  homogeneities  in 
geometry  and  materials.  A  particle  must  first  move  through  the 
fluid  before  short  range  forces  dominate.  For  this  reason,  we 
study  the  farces  on  a  particle  moving  in  a  shearing  fluid  near  a 
wal  1 . 

In  this  paper,  we  derive  the  forces  on  a  spherical  particle 
in  Poiseuille  flow  in  a  channel,  accounting  for  the  inertial 
force  on  the  particle  due  to  the  presence  of  the  wall.  We  use 
the  method  of  Fourier  transform,  first  used  by  Childress  (1964) 
for  this  type  of  inertial  force  calculation.  It  was  applied  to 
shear  flows  by'  Saffman  (1964).  The  method  was  generalized  to 
general  shapes  by  Harper  and  Chang  (1968),  and  to  straining  and 
rotating  flows  by  Drew  (1973). 

Equation  of  Motion 

The  equations  of  motion  for  a  sphere  moving  near  a  wall  in 
an  incompressible  viscous  fluid  of  density  p  and  viscosity  u  are 

▼  .v  =  0  (1) 

'■['41  +  V  •  V  v]  =  -  »  p  +  u’=v  (2) 

in  net),  which  is  defined  to  be 

ll(t)  =  ^  ®  and  jr  —  rp(t)  .  (3) 

Here  v  is  the  velocity  of  the  fluid,  p  is  the  pressure, 
r  =  ^2’  ""a)  spatial  position,  t  is  time,  Tp  is  the 

center  of  the  sphere,  and  a  is  the  radius  of  the  sphere.  We 
assume  rpjCt)  =  d(t)  i  a,  so  that  the  sphere  does  not  intersect 
the  wall. 


This  work  was  supported  by  U.S.  Army  Research  Office  Contract 
DAAG29-82-K-0 1 85 
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The  hoLindary  conditions  are 
V  =  O 

at  Tg  "  '0,  and 


V 


r 

“dt  ^  I-  “ 


(4) 


(5) 


on  [r  “  Tp^’t)  j  —  a-  Here  ^^t)  is  the  angular  v^elocitv^  o^  the 
sphere.  The  equations  of  oiDtion  for  the  sphere  are 


d®r 


m 


dt 

dft 


=  F  +  //  2 


j-pi  +  p  [t  V  +  r 


dt 


=  I  +  J/  (i: 


rpj;<  n 


r  I 


V  +  [^  v  j  *^1 


(7) 


where  m  is  the  mass  of  the  sphere,  I  is  its  mciT^ent  o-f  inertia, 
F  is  the  eKternally  applied  force,  and  T  is  the  externally 

appl i ed  torque- 

Assume  the  sphere  is  small,  and  the  flow  far  from  the 
sphere  is  quadratic.  Specifically,  the  flow  in  the  absence 
of  the  sphere  is  given  by 


dr  j. 


(r  )  ^ 


PQ 


^  p 


D 


(S) 


Here  Z  a,  and  ^  0,  With  no  loss  of  generality  we  set 
pQ  =  O,  The  velocity  of  the  fluid  relative  to  the  sphere  xs 


u  = 


dt 


( 9 ) 


lAle  shall  call  eq,  (8)  the  outer  solution.  This  flow  is  steady, 
Nate  that  this  solution  does  not  satisfy  the  boundary  canditions 
at  the  particle  surface. 


Inner  Flow 

Let  us  consider  the  inner  problem.  The  ve?locity  is  scaled  by 
^  “  !Yo  ^Tp^  “  ^Cp  U  length  scale  is  a.  We  define 


K  =  a/d(0),  and  Re  -  pUa  /n.  We  shall  refer  to  Re  as  the 
Reynolds  number,  and  assume  that  k  and  Re  1* 

Note  that  the  flow  is  unsteady  due  to  the  relative  motion  of 
the  wall  and  the  changing  motion  of  the  fluid  far  from  the 
sphere,  as  viewed  from  the  sphere-  The  motion  of  the 
coordinate  system  is  assumed  slow,  bo  that  we  can  neglect  the 
non-inertial  accelerations  (Coriolis  farce,  centripetal  force) „ 

The  imposition  of  the  wail  boundary  condition  depends  on 
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the  size  of  k .  It  k  =  0<1),  then  the  wall  intlaences  the 
flow  (and  hence  the  motion  ot  the  sphere)  strongly.  For 
smaller  e,  we  must  ascertain  whether  wall  ettects  or  other 
perturbation  effects  are  more  important. 

For  small  e,  the  solution  the  inner  problem  is  given 
by  Brenner  (1964).  The  details  are  omitted,  but  the  salient 
features  will  be  summarized  here.  The  force  and  torque  on 
the  sphere  are  given  by  Faxen's  laws. 

The  force  is 

Fi  =  6ir4a  <yQ(rp)  -  drp  /dt)  -  y  wa*  ^Po  (10) 

If  we  assume  no  net  torque  on  the  sphere,  we  have 

ft  =  |-^XVo.  (11) 

The  inner  flow  is  equal  to  the  outer  flow,  plus  a  Stokeslet, 
plus  terms  which  decay  faster  than  jrj  |  ^  as  |ri  |  -» 


Inertial  Region 

Let  us  consider  the  flow  on  a  scale  between  the  inner  and 
outer.  Harper  and  Chang  (196S)  give  a  careful  explanation  of 
the  balances  which  must  occur.  In  this  region  the  appropriate 
Reynolds  number  is  one  based  on  the  velocity  gradient. 


Re=  — 


I»Vq  Ja‘ 


Roughly,  on  any  length  scale  between  a  and  a  Re^  the  flow 

consists  of  the  outer  flow,  plus  a  Stokeslet,  to  lowest  order. 


For  length  scales  between  a  Re^  and  JE ,  the  flow  is  the 

outer  flow,  to  lowest  order.  A  critical  balance  occurs 

on  a  length  scale  equal  to  a  Reg*-*^*.  At  this  scale,  inertia 

1/2 

contributes  a  perturbation  in  the  flow  of  order  Reg  ,  which 

by  virtue  of  Faxen's  laws,  give  an  O^Reg'*^*]  contribution  to 
to  the  force. 


The  approximate  equations  valid  in  the  inertial  subregion 


are 


11 

>1 

■ 

> 

-  0 

pUs 

- 

3  y 

V  =  0  on 

+  VjB 


.)■- 


'p  Jl^*V 


(12) 


(13) 


(14) 


Where  Ug  =  ^  (d  -  ^)  ,  and  for  convenience,  we  have  translated 

the  origin  to  the  sphere  center.  The  solution  must  match  to  the 
Stokeslet  from  the  inner  solution. 

We  can  also  see  the  relation  between  the  distance  from 
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boandary  and  the  inertial  -force.  I-F  d  a.  ll  d^  the 

inner  -flow  the  boundary.  The  inner  problem  rnay  be 

solved  by  perturbation  methods  (Cox  and  Hsu  1977)  -  I-f  a 

i  /  2 

d  Re^  ,  the  inertial  correction  due  to  this  inertial  -flow  will 

occur  at  0(Re^)^^^  and  will  dominate  the  O ( k )  corrections.  In 
this  case.,  the  analysis  o-f  Saffman  (1965,  1968),  generalized  by 

Harper  and  Chang  (196S)  gives  the  correction  to  the 

-force.  Uihen  a  =  O  Id  the  -force  can  be  computed  from 

(12-14).  To  compute  the  force  on  the  sphere,  we  must  compute 
the  induced  velocity  at  the  origin.  Then 

F  "  6?rnav^(0).  (15) 

Harper  and  Chang  (1968),  following  Saffman  (1965;  1960), 

found  the  solution  to  (12-13)  in  the  infinite  domain-  Their 
solution  also  matches  to  the  Stokeslet  at  r  =  O-  If  we  denote 
the  solution  in  the  Infinite  domain  as  y^,  p^,  and  write 


V  :=  Vq  +  Vi  , 

P  ^  Po  Pi  . 


'  1/2 

r  -  r  /a  Re^ 

d  ^  d  /a  Re^^^ ^  , 
we  have 


3 


-  ^Pi  +  U^^Vi  ^ 


(16a) 

(16b) 

(16c) 
( 16d) 

(17) 

(18) 


(19) 


where  we  drop  the  prime  on  r  and  d- 

If  we  apply  a  Fourier  transform  in  ri  and  r2i  we  have 
A 


Pi  =  11  irii^,r3)e^!i-!:dk  , 


(20a) 


A  A 

where  r  =  (rj,  Tj,  U),  k  =  (k^,  k 


(20b) 

,  ,  O) .  If  Fourier  transforms 


are  taken  of  (17,18),  and  the  pressure  transform  is 
eliminated,  a  differential  equation  can  be  derived  for  the  e, 
component  of  Pi*  It  is  this  velocity  which  gives  rise  to 
transverse,  or  lift  forces.  The  equation  is 


ar. 


J.3 

4 


(2k2 


A 

i  k  ir  3  ) 


(1<^  ikj^rjicZ)  _  Q 

(21) 
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If  we  let  G±i  “  ,  we  have 

?^13  _  (^  2  +  -i  ^.-  ^  ==  ,  i 

_  2  •  ^-  ‘  i'  J  ••  --  i  3 

Eq.  (22)  is  an  Airy  equation.  The  solution  for  P^^  is 
Tis  =  ae-^f“3  +  b^Crj) 


(23) 


where 


f(r;)  =  [  2  cosh  l^fr^  -  s) 


] 


Cs)  ds 


(24) 


where  G^^Cs)  is  the  solution  of  (22>  which  decays  as  s 
This  solution  is  found  by  Generalized  Laplace  transforms- 
The  initial  conditions  for  P13  are  determined  from  eq-  (16-a) 
in  conjunction  with  Harper  and  Chang's  solution-  We  have 


A 

^15 


d ) 


-  r  p 


j 

.(U 


0  1  =  3  ^  ^ i 


.•>  \ 


fK,  -  d)  =  -ik  ■  r^C^,  -d) 


—  1 


lT> 

J<Ur 


0  li  ^  ^2^02  i 


)  e  i  ^  d  k  : 


(26) 


where  Poii?  ^0211  ^  u  5 components  of  the  Fourier 
transform  of  v^.  They  are  given  by  Harper  and  Chang  (19<SEi). 

The  force  in  the  direction  calculated  by  Saffman  and 
Harper  and  Chang  1 s 

Fq  =  (t,-46  <27) 

The  0  correction  to  the  force  in  the  e^  direction 

equal  to 

Fi  =  65ruaVi3(0)  =  JJ  Pig  (k,  O)  dk  ,  <28) 


The  integrals  involved  in  finding  P^j  and  subsequently  F^ 
have  been  approx i mated  number ically  using  NAG  library  routine 
with  truncation.  The  caiCLaations  were  expensive  and  required 
large  amounts  of  computer  time  and  memory.  We  speculate 
whether  a  more  direct  approach  might  have  been  better.  The 
result  for  F  -  F^uavj|^3(0)  is  shown  in  Figure  1  as  a  function 
of  d.  The  total  lift  force  is  obtained  by  adding  (27)  and 

1/2 

appropriate  value  -from  Figure  1,  and  multiplying  by  (Re^) 

Conclusions 

The  lift  force  due  to  the  slip-shear  interaction  is 
enhanced  by  the  presence  of  a  wall.  The  contribution  due  to 
the  wall,  shown  in  Figure  1,  decreases  with  distance  from 
the  wall.  The  wall  correction  is  quite  small  compared  with 
the  the  slip-shear  interaction  force  at  moderate  distances 
from  the  wall  (  d  =  0(a  Re”*-'^')). 
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“Iloo  loo 

Figure  1 . 


ZOO  3J30  4JJfl  S.00 
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Dimensionless  wall -induced  force  versus 
dimensionless  distance  from  wall. 
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The  above  analysis  presumes  a  -force  F  on  the  sphere  in 
axial  direction-  If  no  force  acts  on  the  sphere,  the 
analysis  gives  a  force  of  ^ero  to  the  order  computed ,  since 
the  Stokeslet  contribution  in  the  inner  flow  is  zero-  With 
Fi  =  O  and  Ti  -  O,  the  next  contribution  to  the  force  is  the 
strainlet  (Schonberg,  Drew  and  Belfort  1985).  This  alters 
the  matching  condition  as  C  ^  eqs-  (17)  and  C13), 

and  leads  to  D(Re^^^)  corrections  (Lin,  Peery  and  Schowaiter 
1970)  - 
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THK  EFFKCTS  OF  NON-SPHERICITY  AND  RADIATIVE  ENERGY  LOSS  ON  THE 
MIGRATION  OF  THE  GAS  BUBBLE  FROM  UNDERWATER  EXPLOSIONS 

K.C*  Heaton 

Weapon  Systems  Section,  Armaments  Division 
Defence  Research  Establishment  Valcartier 
P.O.  Box  8800,  Courcelette 
Quebec,  GOA  IRO 


ABSTRACT 


One  of  the  more  important  phenomena  associated  with  the  upward 
motion  of  a  gas  bubble  from  an  underwater  explosion  is  the  significant 
departure  from  sphericity  near  the  times  of  the  mlnlraura  bubble  radius - 
Neglecting  this  change  in  shape  results  In  the  prediction  of  a  much 
faster  upward  velocity  than  actually  occurs*  The  inclusion  of  this 
effect  In  the  equations  of  motion  has  been  exceedingly  difficult  be¬ 
cause  of  the  large  magnitude  of  the  departure  from  sphericity* 

In  this  work,  the  shape  of  the  bubble  is  described  by  an  ellipsoid 
whose  axes  are  allowed  to  vary  Independently,  thus  modelling,  to  first 
order,  the  changes  of  bubble  shape*  The  Lagranglan  equations  of 
motion,  incorporating  the  effects  of  the  change  of  shape  and  of  energy 
loss  by  the  radiation  of  sound,  are  derived  and  solved  for  the  case  of 
a  spheroidal  bubble*  The  results  of  these  calculations  for  various 
Initial  conditions  are  compared  with  analgous  cases  for  a  spherical 
bubble* 

It  is  found  that  the  spheroidal  bubble  model  predicts  a  reduction 
In  the  upward  translational  motion  of  the  bubble  of  a  factor  of  approx¬ 
imately  2*  A  comparison  of  the  predicted  upward  motion  of  a  spheroidal 
bubble  produced  by  227*27  kg.  of  TNT  detonated  A6  metres  below  the 
surface  shows  good  agreement  with  that  which  has  actually  observed. 
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I.  INTRODUCTION 


The  formation  of  a  bubble  of  gaseous  detonation  products  always 
accompanies  an  underwater  explosion.  This  bubble  rises  toward  the 
surface  of  the  water,  responding  as  it  does  so  to  the  change  in  the 
external  pressure  distribution  with  oscillatory  motion,  during  the 
course  of  which  it  loses  some  of  its  energy  through  the  emission  of 
sonic  pulses.  Although  the  bubble  is  Initially  spherical,  the  effect 
of  its  upward  motion  is  to  distort  it  into  a  non-spherical  shape,  which 
becomes  most  pronounced  in  the  neighbourhood  of  the  minimum  radius. 

The  alteration  in  the  bubble’s  shape  further  affects  both  the  pulsa¬ 
tions  and  the  upward  translational  motion  of  the  bubble.  By  means  of 
finite  element  techniques,  the  equations  of  motion  of  the  bubble  can  be 
solved,  taking  into  account  the  effects  of  the  changing  shape  of  the 
bubble,  although  the  amount  of  computing  time  required  by  this  method 
limits  its  utility-  Finite  element  methods  have  a  further  disadvantage 
in  that  physical  insights  into  the  systems  considered  are  rather  more 
difficult  to  come  by  than  might  otherwise  have  been  the  case. 

Herring  (1942)  and  others  (eg-  Taylor  (1942)  and  Shiffraan  and 
Friedman  (1944))  have  treated  the  problem  of  the  motion  of  the  bubble 
by  considering  it  to  be  a  perfect  sphere  throughout  its  entire  motlon- 
This  treatment  yields  values  for  the  periods  of  radial  pulsations  of 
the  bubble  which  are  in  good  agreement  with  experimental  data,  but 
predicts  a  much  more  rapid  movement  toward  the  surface  than  Is  actually 
observed.  This  arises  because  the  largest  upward  velocities  of  the 
bubble  occur  at  those  times  when  the  bubble  is  near  its  minimum  radius; 
it  is  precisely  then  that  the  largest  departures  from  sphericity  occur - 
Penney  and  Price  (1942)  and  Ward  (1943)  included  the  effects  of  the 
non-sphericity  of  the  bubble  on  its  motion.  However,  it  was  always 
explicitly  assumed  in  their  derivations  of  the  velocity  potential  of 
the  flow  about  the  bubble  that  the  departures  from  sphericity  are 
always  small.  Accordingly,  their  equations  are  not  applicable  near  the 
times  at  which  the  bubble  is  at  its  smallest  volumes. 

Hicks  (1972)  was  able  to  bring  the  value  for  the  upward  transla¬ 
tional  velocity  of  a  spherical  bubble  at  a  minimum  radius  into  agree¬ 
ment  with  experimental  data  by  adding  a  drag  terra  to  the  equations  of 
motion.^  In  his  formulation,  the  drag  coefficient  Is  an  empirical 
correction  whose  value  Is  chosen  to  make  the  predicted  rate  of  rise  at 
the  first  minlraura  radius  consistent  with  observation.  However,  a 
different  drag  coefficient  must  be  selected  for  each  charge  mass  and 
depth,  requiring  a  comprehensive  data  base  from  which  the  appropriate 
value  can  be  chosen  for  each  case.  For  these  reasons,  a  model  for  the 
bubble  in  which  large  deviations  from  sphericity  and  their  effects  on 
its  translational  motion  are  treated  would  be  of  considerable  practical 
and  theoretical  interest. 
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Ill  this  work,  a  Lagrangian  is  derived  for  a  bubble  whose  shape  is 
not  constrained  to  be  always  spherical  but  may  become  ellipsoidal  as  it 
moves  upwards-  The  equations  of  motion  for  a  general  ellipsoidal 
bubble,  incorporating  the  effects  of  loss  of  energy  by  radiation,  are 
presented-  The  algorit’nin  by  which  these  equations  of  motion  were 
solved  numerically  is  briefly  discussed.  Computational  results  for 
some  charge  masses  and  depths  are  presented  and  compared  with 
experimental  data. 

II.  REVIEW  OF  PREVIOUS  WORK 


Taylor  (1942)  derived  equations  describing  the  motion  of  a 
spherical  bubble  of  gas  undergoing  both  radial  pulsations  and 
translational  motion  toward  the  water  surface.  These  are: 

2Tipa'^(^)  +  '^a^U^'  +  [2.l] 

-  -  E(a) 

U  =  -  ^  f  a^dt  [2.2  ] 

a  o 

where  a  is  the  radius  of  tVie  bubble  as  a  function  of  the  time  t,  U  its 
upward  velocity,  2  the  position  of  the  bubble  below  the  pressure  datum 
(i.e*  below  the  zero  pressure  level),  E(a)  the  internal  energy  of  the 
gas  comprising  the  bubble,  g  the  gravitational  acceleration,  the 
total  energy  of  the  bubble  and  p  the  density  of  the  water.  In  Taylor's 
formulation,  there  was  no  mechanism  included  for  energy  loss,  and  hence 
Yq  was  taken  to  be  a  constant.  For  TNT  explosions,  it  has  been  found 
(Herring  1942)  that  approximately  50%  of  the  total  explosion  energy  is 
retained  by  the  bubble;  in  that  case, 

Yq  =  (1.85  X  [2.3] 


where  is  measured  in  ergs,  and  M,  the  original  mass  of  the  explosive 
charge,  is  given  in  gra.  If  one  assumes  that  the  gaseous  explosion 
products  obey  the  ideal  gas  law,  then  the  internal  pressure,  P,  is 
given  by 


P  =  k(Pg)Y 


[2.  A] 


where  Pg  is  the  density  of  the  explosion  products  and  y  the  ratio  of 
specific  heats-  Assuming  that  the  entire  mass,  M,  of  the  explosive  has 
been  converted  to  gas. 


Pg  = 


/An>, 


[2.5] 
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and  hence. 


E(a)  =  7  PdV 
a 

(Y-1) 

where  dV  =  47ia'^d3,.  Taylor,  vising  the  work  of  Jones,  set 
k  =  7.83  X  10^' 


Y  =  1.25 


[2.6] 


[2.7] 


for  TNT  where,  in  eq.  [2*6],  E(a)  is  measured  in  ergs  and  m  in  gm. 

Hicks  (1972)  incorporated  a  drag  force,  F^,  into  the  equations  of 
motion,  where  is  given  by 


1  ?  7 

=  Y  Cj^upa^U 


[2.8] 


and  the  value  of  the  drag  coefficient,  C^,  was  chosen  to  be  C|^  -  2.25 
in  order  to  bring  the  distance  travelled  upward  by  the  bubble  at  its 
first  maximum  into  agreement  with  that  actually  observed  for  500  lbs. 
of  TNT  detonated  150  ft.  belov^  the  surface.  By  differentiating  eq* 
[2.2],  he  obtained  the  rate  of  change  of  momentum  with  respect  to  time. 
The  incorporation  of  F^^  into  the  equation  of  motion  yielded; 


■j^a^U)  =  2a^g  -  I*  Cjja^U^ 

L,  the  Lagrangian  of  the  bubble,  is  given  by 

L  =  ■*'  J  ~  ^a^Pgz 

-  E(a) 


[2.9] 


[2.10] 


Hence,  a  more  general  form  for  the  equations  of  motion  of  a 
spherical  bubble  is  given  by 

If  (§)-  it  ■  Qa. 


4 _ ^  —  -SL.  =  0 

^  az  ^ 


[2.11] 


where  a  =  z  =  ^  =  -U,  and  the  are  the  generalised  dissipative 

forces- 


ThG  dissipatl\7e  drag  force,  gWen  by  in  eq.  [2*8  ]•  It 

has  previously  been  shovni  (Heaton  1984)  that  the  generalised  force 

for  dissipation  by  radiation  of  sound,  is  given  by 

=  -I6iipa^  ^ 

where 


[2.13] 


The  terms  contained  in  AQ^  are  analogous  to  the  radiation  reaction 
terras  in  electromagnetic  theory,  and  hence  can  be  ignored  in  a  first 
approximation,  although  this  approach  will  underestimate  the  energy 
loss  near  the  miniraum  radius,  and  overestimate  It  elsewhere* 


AQ  =  4npa^[4aaa'+  (a“a  ^/a)] 


*2 

a  -  and  Cg  is  the  speed  of  sound  in  the  water, 

dt 


[2.12] 


Ward  (1943)  and  Penney  and  Price  (1942)  derived  equations  of 
motion  for  a  nearly  spherical  bubble  by  expanding  the  velocity  poten¬ 
tial,  5,  of  the  flow  about  the  bubble  in  terras  of  the  Legendre  poly¬ 
nomials,  thusly: 


r 


+  B, 


Ucos0) 


l2(cos9) 

7 


[2.14] 


where  the  coefficients  A,  and  B2  functions  only  of  time*  They 

further  assumed  that  the  radius  vector,  R(t),  from  the  centre  of  the 
bubble  to  a  point  on  its  surface  could  be  written  as 


R(t)  =  a  +  b2?2(cos6)  +  **-  [2*15] 

where  a,  b2,  and  bg  are  functions  of  time  only.  At  the  surface  of  the 
bubble. 


_  A 


+ 


(■^)  -  u  cose 

or  ^ 

B  B 

2  ^  Pj^(cosG)  +  3  — 


[2.16] 


Substituting  eq.  [2.15]  into  eq.  [2.16]  and  equating  the  coefficients 
of  the  Legendre  polynomials  on  both  sides  of  the  equation  yields: 


A  -  .,2  da 
A  -  a 


R  n  6  “2.,  _  1  ^3„ 


^  +  2  -5^2  -  3  ^ 
a  a 


db3 


A 

+  2  b3  =  4  ^ 


if  2i  b. 


[2.17] 
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At  the  surface  of  the  bubble,  tlie  pressure  must  be  uniform  and  equal  to 
the  Internal  gas  pressure.  Using  Bernoulli’s  equation,  this  condition 
can  be  written  as 

gz  -  gRcosS  +  (||)  -  =  ^pg)^  [2-18] 

R  ^ 

If  one  substitutes  for  $  and  R  in  eq.  [2.18],  using  eqs-  [2.14]  ^ 
[2*15],  multiplies  the  resulting  equation  in  turn  by  each  of  the 
Legendre  polynomials,  and  then  integrates  over  cosG,  the  orthogonality 
relations  among  the  Legendre  polynomials  produce  4  differential  equa¬ 
tions  : 


g2  +  a(^)  +  i  ^  +  OCU*)  " 

^  (a^U)  -  ^  ^  ^  dt  +  O(U^)  =  ga^, 

dBo  3Bo  00  j  d^ a  V 

"  J 


1  ^^2  ^*^2  da  .  3  „2 

a  a 


3B. 

dt 

o  a^ 


+  O(U^)  =  0, 

1  ^^3  da  1  d^a  ^ 

TT  It-  "  3-  ITF  ■  T-2-  J  TT"  ) 

a  rl  r  n 


=  0 


[2.19] 


Ward  (1943)  has  estimated  the  value  of  b2,  which  measures  the  depar¬ 
tures  from  sphericity,  in  eq-  [2.15],  and  found  that  it  remains  small 
until  the  bubble  begins  to  contract.  Near  the  bubble’s  minimum  radius, 
b2  becomes  greater  than  a,  making  the  whole  calculation  1 aval id - 

III.  EQUATIONS  OF  MOTION  FOR  AN  ELLIPSOIDAL  BUBBLE 


Now,  let  a,  b,  c  be  the  semi-axes  of  an  ellipsoid  along  the  x,  y, 
z  axes,  respectively,  for  a  co-ordinate  system  whose  origin  is  at  the 
centre  of  the  ellipsoid*  Let  the  ellipsoid  be  immersed  in  a  fluid  of 
infinite  extent,  and  let  one  of  the  axes,  say  a,  vary  with  respect  to 
time.  At  any  instant  in  time,  the  equation  of  the  ellipsoid  will  be 
given  by 


[3.1] 


The  velocity  potential,  for  the  flow  about  the  ellipsoid  is 
given  by  the  solution  to  Laplace’s  equation, 

=0  [3.2] 

with  appropriate  boundary  conditions.  Since  the  problera  obviously 
possesses  ellipsoidal  symmetry,  it  Is  most  convenient  to  transform  to 
ellipsoidal  co-ordinates,  thusly: 
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[3.3] 


^2  .  IaVx)Ca^+|iKa.^t^) 

v2  =  (blty.(b^+lAl(b^+vl 
^  (bO)(b2-a2)  ’ 

(c^-a^Xc^-b^) 

where  X,  [i,  v,  are  the  ellipsoidal  co-ordinates.  The  surfaces  defined 
by  X  constant,  \x  -  constant,  v  =  constant,  are  confocal  quadrics- 
Because  of  the  symmetry  which  exists  in  the  transformation  equations, 
eq*  [3-3],  one  is  allowed  to  specify  which  co-ordinate's  constancy  will 
yield  confocal  ellipsoids-  Throughout  this  paper,  then,  the  relation  X 
=  constant  will  be  taken  to  describe  the  family  of  confocal  ellipsoids- 
In  ellipsoidal  co-ordinates,  Laplace's  equation,  eq-  [3.2],  is  given 
by: 


+  ®  - » 

where  is  given  by 

kx  “  ((a^+\)(b^+X)(c2+\)) 


[3.4] 


[3.5] 


and  the  expressions  for  and  k^  can  be  obtained  from  eq.  [3.5]  by 
symmetry. 

Now,  let  a  be  a  solution  to  eq-  [3.4],  and  let  another  solution 
be  given  by 


®  =  axW 


[3.6] 


where  x  fs  a  function  of  X  only.  By  substituting  eq,  [3.6]  into  eq. 
[3.4],  one  finds  that  a  must  have  the  form 

a  ^  a.f(p,v)  [3-7] 


where  is  a  fvinction  of  X,  only  (Milne-Thompson  1949)-  Using  eq. 
[3.6]  to  aid  in  the  solution  of  eq-  [3.4],  one  finds  that 


x(^)  =  A  /  +  B 


[3.8] 


where  A  and  B  are  arbitrary  constants.  Hence,  if  a  is  a  solution  to 
eq.  [3.2], 


$  =  a  / 


[3.9] 
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is  also  a  solution.  The  solutions  to  eq.  [3.4]  having  the  form  of  eq* 
[3.9]  are  the  ellipsoidal  harmonics. 

In  the  case  of  the  spherical  bubble,  Taylor  and  Herring  assumed 
that  its  pulsations  would  be  described  by  simple  radial  oscillations. 
Since  a  sphere  Is  a  degenerate  ellipsoid,  it  would  be  reasonable  to 
choose  as  a  solution  to  eq.  [3*4]  the  ellipsoidal  harmonic  which  pro¬ 
duces  analogous  oscillations.  Such  a  velocity  potential  is; 


[3.10] 


since  ”  1  is  a  solution  to  eq.  [3.4].  The  upper  limit  of  the  inte¬ 
gral  has  been  chosen  in  order  that  the  potential  become  0  at  an 
infinite  distance  from  Its  source.  The  constant  of  integration  A  is 
determined  by  the  boundary  conditions.  Now,  the  boundary  conditions 
for  a  pulsating  ellipsoid  are  not  as  obvious  as  those  for  a  sphere. 
Nonetheless,  it  seems  apparent  that  at  that  point  on  an  axis  which  is 
on  the  surface  of  the  ellilpsold,  the  normal  velocity  of  the  fluid  must 
be  equal  to  the  rate  of  change  with  respect  to  time  of  that  axis. 


Hence,  for  an  ellipsoid  inwhichonly  one  axis  is  allowed  to  vary, 


say  a, 


[3.U] 


where  a  4^,  is  the  normal  derivative  of  the  potential,  and 

n  is  the  unit  outward  normal  to  the  ellipsoid*  In  ellipsoidal 
coordinates,  eq.  [3.11  ]  can  be  written  as: 


2  abc  A  ^ 


[3.12] 


where  the  potential  $  Is  given  by  eq.  [3.10 ]. 


Now,  at  x=a,  y“o,  z=o 


[3.13] 


[3.14] 
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Hence , 


A 


y  abc 


[3. 15] 


and 


The  kinetic  energy  of  the  fluid  around  the  ellipsoid  is  given  by 

T  ~  T  P  #  6?  [3-17] 


where  the  integral  is  carried  out  over  a  bounding  ellipsoid  described 
by  eq-  [3,l],  and  over  a  second  outer  boundary  which  is  obtained  by 
allowing  X  in  eq.  [3.3]  to  approach  infinity,  essentially  describing  an 
infinitely  extended  ellipsoid.  The  contribution  to  eq.  [3.17]  from  the 
large  ellipsoid  vanishes,  leaving  only  that  at  the  inner  surface  i.e. 
the  bubble  itself*  The  surface  integral  can  be  transformed  into  one 
over  the  x-y  plane,  thusly: 


ds  .  //  dy 


[3.18] 


where  z  is  the  unit  normal  along  the  z  axis*  The  unit  normal,  n,  to 
tlie  surface  of  the  ellipsoid  is  given  by 


A 

11  - 


2  2  2 


^  J.  .Y_  ^  J-  z 

r  (^7  ^  y  +  z) 


[3.19] 


in  the  ellipsoidal  co-ordinates, 

_  1  a<C'^ 


^'1 


where 


,2  _  1 


■  (7:1 


X 


(b'+X)^  (c+X)" 


(a^+X)- 

So,  at  the  surface  of  the  ellipsoid,  X=o, 


) 


[3.20] 


[3.21] 


M 

bn 


2  2 
a  D 


+  V 

c 


[3.22] 


Using  eqs.  [3.16],  [3.18],  [3.19],  and  [3.22]  and  taking  account  of  the 
contributions  from  the  half  surfaces  above  and  below  the  x-y  plane,  eq* 
[3.17  ]  becomes 
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[3.23] 


.2 

T  =  1-  .0- 

'■  2  a 


where 

z  =  c(l-  ^  [3.24] 

Evaluating  eq.  [3-23]  finally  yields  the  expression  for  the  kinetic 
energy,  T,  of  the  flow  about  an  ellipsoid,  described  by  eq.  [3*l],  when 
the  semi- axis  a  is  allowed  to  vary  with  respect  to  time; 

T  =  Tipa^b^c^  J  ^  [3.25] 

O  L  J 

Since  the  equations  are  symmetrical  with  respect  to  all  three  semi- axes, 
it  is  possible  to  obtain  expressions  for  the  kinetic  energy  of  the  flow 
about  an  ellipsoid  when  each  of  the  other  semi-axes  are  varying  by  means 

,  and  c,  thusly : 

[3.26] 

[3.27] 


Hence,  by  adding  eqs.  [3.25]  -  [3*27],  and  renormalizing,  one  can 
obtain  the  kinetic  energy,  T,  for  the  flow  around  an  ellipsoid  when  all 
three  axes  are  allowed  to  vary  in  time: 


of  cyclic  permutations  of  a, 

T  =  iipafi“c^  / 

o  \ 

T  =  .pa2b2^2  J  ^ 

where  6  =  4r  arid  c  =  4—  . 


+  a2i;2,2 

+  a  b  c 


[3.28] 


The  normalisation  factor  3  has  been  chosen  so  that  eq.  [3.28]  is  in 
agreement  with  the  term  for  the  kinetic  energy  due  to  radial  spherical 
pulsations  in  eq.  [2.l]  when  a=b=c  and  a“6=c. 


A,s  mentioned  above,  it  is  not  entirely  clear  what  sort  of  boundary 
conditions  are  applicable  at  the  surface  of  a  pulsating  ellipsoid-  In 
fact,  the  specification  of  boundary  conditions  when  all  three  semi-axes 
are  varying  is  equivalent  to  putting  constraints  on  the  interactions 
among  a,  and  c,  which,  in  turn,  is  equivalent  to  specifying  which 

shapes  the  bubble  will  be  allowed  to  assume.  Equation  [3.28]  corre¬ 
sponds  to  a  velocity  potential  which  has  been  so  constructed  that  the 
velocity  cross-terms  in  eq-  [3*17]  cancel  out.  Since  a,  ft,  and  c,  are 
all  mutually  perpendicular,  this  seems  physically  reasonable.  This  has 
the  effect  of  insisting  that  the  movement  of  a  point  on  an  axis  of  the 
ellipsoid  is  due  only  to  the  change  in  length  of  that  axis,  with  the 
changes  in  the  lengths  of  the  other  two  axes  contributing  nothing - 
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The  velocity  potential  associated  with  the  purely  translational 
motion  of  the  ellipsoid  along  the  z  axis  is  well  known  (e.g*  Milne- 
Thompson  1949)*  The  boundary  condition  is 

"  In  \  “  Ucos©  [3.29] 

\-o 

wViere,  as  before,  -U  =  ^  "  Zj  angle  between  the  z  axis  and 

the  normal  to  the  surface  of  the  ellipsoid,  and  z  is  the  position  of  the 
bubble  centre  below  the  pressure  datum-  Since 

eq.  [3.29]  becomes 


m 


5\ 


) 

\=o 


-■'if) 


X=o 


[3.31  ] 


The  solution  to  eq.  [3.4]  which  satisfies  the  boundary  conditions 
in  eq.  [3.31  ]  is  the  first  order  ellipsoidal  harmonic  given  by 


$  = 


[3.32] 


where  C  is  a  constant  of  Integration.  Direct  substitution  of  eq.  [3.32] 
back  into  eq.  [3.31  ]  yields 


C 


abc  U 
2-“o 


[3.33] 


where 


a 


o 


abc 


7 

o  (c^+\)k^ 


[3.34] 


The  kinetic  energy  of  the  flow  around  a  translating  ellipsoid  is  then 
given  by: 

T  =  -  7  p  //  ^  |~  dS 

“  2(2'-'a'g)  P'^^  ^  cosG^dS  [3.35] 


=  ^  abcp 

As  before,  the  integration  is  carried  out  over  a  bounding  ellipsoid 
whose  semi-axes  are  a,  b,  and  c,  and  over  one  whose  semi-axes  are 
allowed  to  extend  to  infinity,  where  the  contribution  from  the  outer 
ellipsoid  vanishes*  Hence,  the  kinetic  energy  for  the  flow  generated  by 
a  translating  ellipsoid  whose  semi-axes  are  varying  in  time  is  given 
by : 
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[3.36] 


I  .  ^  ^  .,2,2.2,  J  « 

o  A 


+  ^  abcp(j2— 


Equation  [3.. 36]  has  no  provision  for  the  Lateraction  of  a,  6*  or  c  with 
U*  Again,  this  is  not  unexpected  since  eq-  [3-35]  should  yield  the 
kinetic  energy  terms  in  eq.  [2.1],  as  it  does  in  fact  do,  when  a=b=c  and 
a=D=c-  Physically,  as  long  as  the  bubble  is  symmetrical  with  respect  to 
the  x-y  plane,  the  contributions  from  tlie  top  and  bottom  halves  to  a 
coupling  of  the  oscillatory  motion  with  the  translational  exactly  cancel 
each  other. 


The  energy,  V^,  associated  with  the  hydrostatic  pressure  around 
the  bubble  is; 


V  =  ^  pgabcz 


The  internal  energy  of  the  bubble,  E(a,b,c),  is  given  by 


E(a,b,c) 


[3.37] 


[3.38] 


Hence,  the  Lagrangian,  L,  for  the  flow  around  an  ellipsoidal 
bubble  whose  semi-^axes  are  a,  b,  c,  at  a  depth  z  below  the  pressure 
datura,  where  the  z  axis  is  parallel  to  the  ellipsoid  axis  c,  and  which 
is  moving  with  a  translational  velocity  U  =  -  ^  is  given  by 


L  .  f  pUhV  +  3^62^2  ,  ^2,2^2,  I  ^ 


+  ^  pabc  ~  ^  pabcgz 


.  kMYa~(^~^)b~Crl)c.-(Y~l) 


[3.39] 


The  total  energy  of  the  bubble,  Y(t),  at  any  time  t  is  therefore 
given  by: 


Y(t)  =  4  p(a2b2c2  +  ]  di 

o  \ 

2 

+  pabcCy-g— ^  pabcgz  [3.40 ] 

+  kMVa~^Y~nh-(Y-l)c-(Y-U 
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Hence,  t)ie  equations  of  motion  for  the  bubble  are: 


It  aa 

d_  _  Ml  =  Q 

dt  ab 

d _  Mi  =  0 

dt  5c  ^c* 

d  /Mi\  _  Mi  =  0 
^  oz  * 

where  the  are  the  generalised  dissipative  forces* 

The  substitution  of  eq.  [3.39]  into  eq.  [3.41  ]  results  in: 


pb^c^I^a  =  ~  ^  P  [^ 


2-k 


1  2  a' 


aa 


+  (ac^bl^  +  b^c^  +  (2b2cl^  +  b^c^  ^)ac] 


+  ^  p(a2b2e2  -b  a^b^c^)  ^ 

+  ^  p(ac^!>^  +  ab^c^)!^  +  ^  phcf{a^)z 

2u  ,  2if(a  ),2  ^ 

4-  ^  pabc  z - f  pgbcz 

+  (y-l)  Ka“''^b“^  +  Q.^, 


.2i.2»2x  ^^o 


^  ^^2,2  *•  =  _  2.  p^,2,2  ^ 


^  pa  c 


4  {Zach^  4  a^c^  §^)afi  4  (2a2cl„  4  a^c^ 


2  2 


4  f  p(aVc2  4  a2s2,2  ^  ,2,2-2)  ^ 


+  p(bc  ^  ba'^c^)!^  +  — j  pacf(aQ)z 

2%  V  4ii 

4  — j  pabc  — —  z - j  pganz 

4  (Y-l)Ka“^''^“^h^'Y)c~^l'~^^  4  Q,, 


[3.41] 


[3.42] 


[3.43] 


ilN'C  i-AS S  I.  i. 
13 


.,51 


+  (2^b\  +  +  (2a2bl  +  ^)62] 


+  f-  paw  ^  +  a^-b^c^)  ^ 

+  p(a^b^c  +  a^6^c)lQ  +  pabt‘(a|^)z 
^  2ti  u  *2  Are 

+  ~j  pabc  -  z - j  pgabz 

+  (Y-l)Ka”('Y-l)b“(Y"^)c"^  +  Q  , 


2^2.2  ^  .,2,  2.2, 


51, 


[3.AAJ 


da„ 


.*. •_  /a  ,  6  .  c,?  2  ““-0  • 

-  (a  +  -^  +  c)z  -  O'2-aJ  ^  ^ 


f(V 


+  Q^. 


whsre! 


a  = 


=  4_§.  b  =  ^ 


dt" 


dt^ 


c  = 


_  d^c 


dt 


*•  _  d^z 


I  =  f  ^ 

o  h* 

f(“o)  =  17^* 


££o  -  ^  I  6  +  Ho  . 

dt  aa  ®  ^  &b  ^  ^  ac 


df(ao)  df(a^),  ,  af(a^)^.  ,  afCa^). 

dt  aa  ®  ■'■  ^b  ^  ■'■ 

K  - 


[3.45] 


[3.46] 


In  order  to  complete  the  derivation  of  the  equations  of  motion  of 
the  bubble,  it  is  necessary  to  determine  Q^,  Q^,  and  Q^,  the  general^ 
ised  dissipative  forces  associated  with  the  radiation,  of  sound  by  the 
bubble.  Now,  at  distances  large  compared  with  the  scale  of  the  bubble, 
the  form  of  the  velocity  potential,  In  the  fluid  will  be  identical 
with  that  of  a  spherical  bubble.  Hence, 

$  “  -  7  +  ^  •  ^(l/r)  [3-47] 
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where  C  and  A  are  consUanls  which  depend  only  on  the  time  r  is  the 

distance  to  the  field  point:  from  an  origin  located  somewhere  within  the 
bubble- 

Followlng  the  development  given  by  Landau  and  Lifshitz  (1966),  in 
the  wave  zone, 

$  =  C^)  +  ^.(A(t.)  9)  r3  4gj 

where  0  Is  the  angle  between  the  direction  of  the  translational  motion 
and  r,  and  the  retarded  time  t’  is  given  by 

f  =  t  -  ^  ■  [3.49] 

where  Cg  is,  as  before,  tVie  velocity  of  sound  in  water.  The  velocity, 
\^,  of  the  water  in  the  wave  zone  must  therefore  be  given  by: 

V  -  - 

«  (I  QC(t) 
bt 


cosQ  b  A(t)^ 
S 


[3.50] 


+. .  ■ 


where  terms  of  higher  negative  order  in  r  have  been  neglected.  The 

total  energy  emitted  as  sonic  radiation  per  unit  time,  ~  is  then: 

d  t 

^  =  -  PCg  //  (^•^)dS 

2 

=  _  A.itft.  _  Am 

Cc  5t  3q3  ^^2 


at" 


[3.51] 


where  the  integral  has  been  taken  over  a  sphere  of  radius  r  (Landau  and 
Lifshitz  1966). 

_3  To  a  good  approximation,  the  term  in  eq.  [3.51  ]  proportional  to 
C  can  be  neglected  for  low  translational  velocities,  since  it  will  be 

^  _ 1 

2  orders  of  magnitude  smaller  than  that  proportional  to  Cg  ^ . 


Now,  the  volume  AuC  of  fluid  which  flows  through  the  surface  over  which 
the  integral  in  eq.  [3.51  ]  is  taken  must  be  equal  to  the  rate  of  change 
with  respect  to  time  of  the  volume,  of  the  bubble-  Thus, 


C  = 


I 


(abc  +  abc  +  abc) 


[3,52] 
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and  so 


+  abc  +  abc 

+  a(  ftc  -H  be) 

+  t(ac  +  ac) 

2 

-H  c(ab  +  afi)  ] 


Now,  evidently, 


=  aQ^  +  E.q^  +  cQ„  -!■ 


[3.54] 


Since  all  of  the  dependence  on  the  translational  velocity  in  eq-  [3-5ll 
was  contained  in  the  term  proportional  to  Cg  ,  it  follows  that,  to  the 
same  approximation,  =0  in  eq*  [3,54].  Since,  in  the  absence  of  any 

translational  motion, "there  exists  nothing  to  distinguish  one  axis  of 
the  ellipsoid  from  another,  it  follows  that  one  should  be  able  to  obtain 
the  other  two  from  one  by  cyclic  permutation  of  the  axes  a,  b,  and  c. 
The  only  grouping  of  the  terras  in  eq*  !  3*53]  which  is  invariant  under 
cyclic  permutation  of  the  axes  is  given  by 


dE  ^  471 


p  [F(a)  I  F(ah) 
3  i"I 


+  F(b) 


b)  I  FCaTp  +  F(c)  I  F(a*.)] 
1=1  1-1 


[3.55] 


where 


FCa'i)  = 

a.  a^a., 

J 

a nd  a^  —  a,  ^2  ”  ^3  ” 

Hence,  by  comparison  of  eq.  [3.55]  with  eq.  [3*54], 


[3.56] 


[3.57] 


where,  as  before,  i,  j,  k,  are  succe.ssively  equal  to  1,  2,  3*  Now,  as 
was  the  case  for  the  spherical  bubble,  the  terms  in  eq.  [3.57]  which 
depend  upon  the  products  of  the  pulsational  accelerations  with  them^’ 
selves  or  with  the  pulsational  velocities  are  analogous  to  the  radiation 
reaction  terms  in  electromagnetic  theory.  This  suggests  that  such  terms 
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may  be  igaorable,  at:  least:  In  a  first  approximation-  At  this  stage.  In 
the  absence  of  any  perturbing  force  in  the  x-y  plane,  it  is  possible  to 
allow  t  =  a-  ThisS  allows  one  to  drop  eq*  [3,42j  as  an  equation  of 
motion,  and  replace  t  and  b  by  a  and  a  In  eqs-  [3.43]  ^  [3.45]  and  eq. 
[3-57].  This,  of  course,  specialises  the  equations  of  motion  to  those 
of  a  spheroidal  bubble.  Henceforth,  throughout  this  paper,  the  case  of 
the  spheroidal  bubble  will  be  treated  exclusively. 


To  sum  up:  eqs.  [3.42]  -  [3-45]  are  equations  of  motion  de¬ 
scribing  a  pulsating  ellipsoidal  bubble  undergoing  translational  motion. 
When  ”  Qb  ^  equations  neglect  any  sort  of  energy  loss. 

It  has  been  shown  that  the  energy  loss  from  the  translational  motion  of 
the  bubble  can  be  expected  to  be  negligible  witVi  respect  to  that  from 
the  pulsational  motion  and  so  was  set  to  0  in  eq.  [3-45 J.  It  was 
further  shown  that 

[3.58 

where 


] 


F(a^)  = 

+  AF(a»), 

When  one  wishes  to  ignore  the  effects  of  radiation  reaction, 
AF(aT)  =  0 


and 


[3. 60] 


AFCir)  =  ^rajaj^,  i^j^k 


[3.61] 


when  one  wishes  to  include  them. 


IV.  NUMERICAL  METHODS  OF  SOLUTION 


Before  one  attempts  numerical  solutions  of  the  equations  of 
motion,  eqs.  [3.43]  —  [3.45],  it  is  useful  to  make  them 
non-dimensional -  Thus,  the  substitution  of 

* 

a  =  a  L, 
c  =■  c*L, 
z  =  z  L, 
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Into  the  eqaatioas  of  mot  Lou  used,  where 


1 


yields  a  dimensionless  form  of  the  equations  of  motion.  As  before, 
is  given  by  eq.  [2.s^]  and  g  and  p  are,  respectively,  the  gravitational 
acceleration  and  the  density  of  water.  These  particular  scaling  factors 
in  eq.  [^-2]  were  originally  used  by  Taylor  (1942). 


Slnce  all  of  the  equations  of  motion  have  the  unfortunate  property 
of  singularity  at  the  origin,  it  is  necessary  to  begin  the  integration 
with  a  series  solution.  Taylor  (1942)  suggested  that  the  initial  solu¬ 
tions  to  the  dimensionless  forms  of  eqs.  [2*1  ]  -  [2-2]  be 


it 


a 


2/5 

h.0025^  » 


mic 

z 


it 


z 


where  z*  is  the^initial  dimensionless  depth  below  the  pressure  datum, 
for  values  of  t  near  zero.  Since  the  bubble  can^be  expected  to  be 
spherical  initially,  the  values  for  a  ,  and  z  from  eq.  [4.3]  were 

used  to  taeg^n  thg  integration  at  time  t^,  with  the  additional  require¬ 
ment  that^ a  =  c^.  One  also  needs  initial  values  for  the  rates  of 
change,  a'  and  c  ,  of  the  semi-axes.  These  were  estimated  by  assuming 
that  the  bubble  would  be  initially  spherical  and  substituting  into  the 
diraen^ionles^  form  of  eq.  [2.1  ]  to  find  a  *  Hence,  the  initial  values 
for  a  and  c  are  given  by 


a 


.  (1-  E^))-l 


2ita 


fZ  .  2  * 

-(F-)  -  » 


[4.4] 


*  i: 

a  =  C  , 


where 


E  (a  )  = 


_ 


[4.5] 


Y^  is  the  initial  total  energy,  as  given  by  eq-  [2.3],  and  all  other 
variables  are  as  previously  defined - 


51 


o 


da  » 


Another  uumerLcal  difficulty  concerns 
57  5a  5a 

evidently. 


the  evaluation  of  the  terms 
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[4.6] 
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CD 
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Q 
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2 
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(c^+X) 

5  a 
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0  ^ 

■5H“  - 

bclj^ 

+  abc  Qg  , 

90o 

51, 

U  _ 

■SE"  - 

ablj^ 

+  abc  ^ 

where 


h  -  I  - r  ■  - T77 

O  2  O  2  'y  Of  L 

(a'^+X)  (b'^-HX)  (c'^+X) 


[4.8] 


and  I|  were  evaluated  with  the  IMSL  double  precision  subroutines 
MMLINF  and  MMLIND,  which  compute  incomplete  elliptic  Integrals  of  the 
first  and  second  kind,  respectively-  The  partial  derivatives  of  1^,  eq. 
[4.6],  are,  in  fact,  incomplete  elliptic  integrals  of  the  second  kind, 
and  can  be  evaluated  with  MMLIND,  The  evaluation  of  the  second  terms  in 
eq-  [4-7]  presented  considerable  difficulty.  In  point  of  fact,  no 
commercial  routine  capable  of  evaluating  eqs -  [4-7]  seems  to  exist,  and 
the  difficulties  involved  in  the  composition  of  one  ab  nihilo  are 
formidable*  Accordingly,  as  a  stopgap,  the  terms  in  eqs-  [4*7]  which 
involve  the  derivative  of  an  incomplete  elliptic  integral  of  the  second 
kind  were  evaluated  by  holding  one  of  a  or  c  constant,  and  varying  the 
other  at  each  step  in  the  integration,  thusly: 

"Sa”  ""  T5a  (li(^Aa,c)  -  (I^  (a-Aa ,  c)  )  , 
bl.  1 

^  crhAc)  “  c-Ac)) 


where  MMLIND  was  used  to  evaluate  I^*  Since  Aa  and  Ac  can  be  made  as 
small  as  desired,  theoretically  eq-  [4-9]  can  be  made  to  approximate  the 
true  value  of  the  derivative  as  closely  as  desired;  however,  the  prac¬ 
tical  constraints  of  computational  time,  machine  accuracy,  and  the 
accuracy  of  the  IMSL  subroutines  do  place  limits  on  the  size  of  Aa  and 
Ac- 


The  actual  integrations  were  carried  out  using  a  4  point  Runge- 
Kutta  algorithm  incorporating  automatic  error  controls-  Some  numerical 
difficulties  with  this  method  were  encountered  when  the  radiation  re¬ 
action  was  incorporated  using  eq-  [3-61  ].  Near  the  minima  of  a  and  c,  a 
and  c  become  small,  and  change  sign  as  well-  Because  of  their  depend- 


•'1  j  -^1 

ence  upon  a  and  c 


the  values  of  and  can  oscillate  rapidly. 


adversely  affecting  the  convergence  of  the  integration-  This  difficulty 
was  circumvented  by  the  use  of  a  series  approximation  in  which  a  ^  and 
c  were  replaced  by  averaged  pulsational  velocities.  By  using  eq. 
[3.40],  it  is  possible  to  write 
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[4.10" 


(T^)i* 


(XUX-z. 

'•  Y 


A  ^  A 

-2a  b  e 


4a*b 


T*f  *  * 

c  z 


where 


^  =  h  (a+W-c)'-^ 


JL_  (.*2b*2^*2  /2£*2^*2 


[4.11] 


*2  *2.^2 

+  a  b  c  ) 

and  Y(t)  is  the  energy  of  the  bubble  at  any  time  t,  given  by  eq- 
[3.40]. 

If  one  defines: 


a  “ 


n  (a  ^)I  , 


*  *  * 

2a  b  c 

(a  ^)I  , 


,  *  *  *  A 

4  a  b  c  z 


[‘•12] 


then , 


where  the  positive  value  is  taken  while  a  particular  axis  is  expanding 
and  the  negative  while  it  is  contracting-  By  expanding  the  dimension^ 
less  form  of  eq.  [3*58]  when  AF(a*)  is  given  by  eq*  [3.6l],  and  sub- 

stituting  (a  )  for  (a  )  and  (c  )  ^,  values  for  the  dissipative 
function  incorporating  averaged  radiation  reaction  terms  were  obtained • 


Estimates  for  a  and  c  were  arrived  at  by  the  substitution  of  the 
current  values  of  a,  c,  a,  c,  z,  and  z  into  eqs*  [3.43]  and  [3.44]  with 
and  set  to  zero-  Those  values  for  a  and  c  were  substituted  back 
into  eq*  [3*58],  to  obtain  new  values  for  Q^and  which  were  in  turn 
used  in  eqs.  [3.43]  and  [3.44]  to  obtain  new  estimates  for  a  and  c- 

V-  NUMERICAL  RESULTS  AND  ANALYSIS 


Figures  2—11  show  the  results  of  computations  for  a  bubble 
produced  by  the  detonation  of  2-1136  kg-  of  TNT  6-1  metres  below  the 
surface,  using  eqs-  [3.43]  -  [3.46]  for  a  spheroidal  bubble-  Those 
curves  associated  with  a  spheroidal  bubble  and  labelled  *AY  =  0'  were 
calculated  under  the  assumption  of  no  energy  loss;  that  is,  Qa  ^ 

-  0  in  eqs*  [3-42]  -  [3.46].  The  curves  labelled  ’AY  0,  AF  =  O’,  were 
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calculated  incorporating  radiativ/e  energy  loss,  but  not  radiation 
reaction  terms;  that  is,  eqs •  [3*58]  -  [3.60]  were  used  to  define  and 
Q^.  In  Figs.  2-5,  a  and  c  are  the  semi-axes  of  the  spheroidal  bubble, 
where  a  is  the  semi-axis  in  the  plane  normal  to  the  bubble’s  upward 
motion,  and  c  the  semi-axis  in  tlte  plane  parallel  to  the  bubble’s  upward 
motion. 

Taylor  (1942)  considered ' the  same  case,  using  eqs.  [2.1  ]  -  [2-2] 
for  a  spherical  bubble  in  the  absence  of  any  energy  loss  from  any 
source.  In  Figs.  2-11,  the  curves  labelled  with  ’AY  =  0’  were  obtained 
by  solving  eqs.  [2,10]  -  [2.11  ],  with  which  are  equivalent 

to  Taylor’s  equations  for  a  spherical  bubble.  The  curves  labelled  ’AY  ^ 
0,  AQ^  "=  0’  result  from  the  incorporation  of  radiative  energy  loss, 
neglecting  radiation  reaction,  into  eqs.  [2.10]  -  [2.11  ];  that  is; 
was  given  by  eq*  [2.12],  with  AQ^  -  0.  The  curves  labelled  ’AY  ^  0, 

AQ^  incorporate  energy  loss  including  radiation  reaction;  that  is; 

and  AQ^  were  given  by  eqs.  [2.12]  -  [2.13].  In  Figs.  2-5,  ’radius’ 
refers  to  the  spherical  bubble  radius  as  calculated  in  eqs.  [2-10]  - 
[2,11]. 


The  curves  labelled  ’spherical’  in  Figs.  6-11  are  the  upward 
velocities  and  heights  above  the  original  detonation  point,  obtained  by 
solving  eqs.  [2.10  ]  -  [2.11  ]  for  a  spherical  bubble,  under  different 
assumptions  about  the  nature  of  the  energy  loss*  The  curves  labelled 
’spheroidal’  are  the  same  quantities  obtained  from  the  solution  of  eqs. 
[3.43]  -  [3.45]  for  a  spheroidal  bubble. 

Taylor  (1943)  presented  photographs  showing  the  behaviour  of 
bubbles  generated  by  electrical  discharges  in  oil,  which  are  here 
reproduced  in  Figs.  12-13-  These  show  that  a  bubble  in  the  early  stages 
of  its  motion  is  very  nearly  spherical,  but  that  near  its  minimum 
volume,  it  becomes  approximately  disc  shaped,  with  its  longest  dimension 
lying  in  the  plane  normal  to  the  direction  of  its  upv/ard  motion.  Near 
the  second  maximum,  the  bubble  is  highly  non- spherical,  and,  in  fact, 
seems  to  be  attempting  to  fission,  exhibiting  an  extremely  large  bulge 
on  Its  upper  surface  and  a  flat  lower  surface.  After  the  bubble  has 
passed  through  its  second  maximum,  it  becomes  mushroom-shaped  and 
actually  does  bifurcate  at  its  second  minimum.  The  two  halves  rejoin 
later  to  form  a  distorted  disc. 


As  one  can  see  from  Fig.  2,  a  spheroidal  bubble  reproduces  the 
salient  features  of  the  observed  behaviour,  at  least  qualitatively,  a 
and  c  were  very  nearly  equal  to  each  other,  as  well  as  to  the  spherical 
radius,  at  the  first  maximum.  Near  the  first  minimum,  the  bubble  became 
more  obviously  spheroidal,  with  the  ratio  a/c  assuming  a  value  of  2.34. 
After  the  first  minimum,  the  qualitative  agreement  between  the  sphe¬ 
roidal  bubble  model  and  Taylor’s  photographs  was  less  pronounced.  This 
was  hardly  surprising,  given  that  the  equations  of  motion  constrain  the 


373 


possible  shapes  of  the  bubble  to  spheroids.  llowever,  the  model  does  at 
lea^st  predict  that  the  bubble  would  not  return  to  a  spherical  shape.  In 
addition,  the  spheroidal  model  also  retains  one  of  the  successful 
features  of  the  spherical  model,  that  of  the  prediction  of  the  period  of 
the  bubble’s  oscillation*  The  period  of  the  first  oscillation  In  Fig.  2 
(which  was  defined  for  both  the  spherical  and  spheroidal  bubble  to  be 
the  time  after  a  maximum  at  which  the  first  derivative  of  the  volume 
changed  sign)  was  *26  seconds  for  both  models.  It  should  also  be  noted 
that  the  time  at  which  the  volume  was  a  minimiim  did  not  coincide  with 
the  minima  of  either  a  or  c*  In  Fig.  2,  a  reached  its  first  minimum  a 
full  .01  seconds  before  c  did* 

Figures  3-4  show  the  effects  of  incorporating  radiative  energy- 
loss  into  the  spheroidal  bubble  model,  and  compare  the  results  with  the 
analogous  case  for  a  spherical  bubble.  I^en  radiative  energy  loss  was 
included  in  the  calculations,  the  second  minima  of  a  and  c  occurred 
slightly  earlier,  and  was  more  nearly  coincident.  Figure  5  compares  the 
semi- axes  for  a  spheroidal  bubble  under  various  assumptions  about  the 
form  of  the  energy  loss.  It  is  interesting  that  the  most  noticeable 
difference  among  the  calculations  was  in  the  value  of  c. 

The  outstanding  failure  of  the  spherical  model  for  the  bubble  is 
its  prediction  of  a  too  rapid  rate  of  rise  wlien  the  babble’s  volume  is  a 
ralniniura.  As  Figs.  6-10  show,  the  maximum  upward  velocity  predicted  by 
the  spheroidal  model  was  less  than  that  predicted  by  the  spherical  model 
by  a  factor  of  2,  and  consequently  the  distance  travelled  from  the  site 
of  the  explosion  was  decreased  by  about  the  same  amount*  This  diminua- 
tion  of  the  upward  translational  velocity  of  the  bubble  near  its  minimum 
volume  accounted  for  the  differences  between  the  periods  of  the 
spherical  and  spheroidal  models  after  the  first  minimum  in  Figs,  2-4. 
because  the  spheroidal  bubble  was  deeper  than  the  spherical  one,  the 
hydrostatic  pressure  was  greater,  making  the  period  shorter.  This  is 
also  the  reason  that  the  curves  In  Figs.  2-11  associated  with  the 
spherical  bubble  model  terminated  .1  seconds  before  those  of  the  sphe¬ 
roidal  model.  Because  of  the  spherical  bubble’s  greater  upward  veloc¬ 
ities,  it  reached  the  surface  before  the  spheroidal  one. 

Figure  11  shows  the  energy  possessed  by  the  bubble  as  a  function 
of  time  for  both  the  spherical  and  spheroidal  models.  Since  the  InltLal 
solutions  to  the  equations  of  motion  for  the  spheroidal  bubble  were 
calculated  by  assuming  it  to  tiave  been  initially  spherical,  it  is  not 
surprising  that,  in  the  absence  of  dissipation,  the  energies  predicted 
by  the  two  models  were  found  to  be  constant  and  equal.  The  energy 
losses  predicted  by  the  spheroidal  model  without  radiation  reaction  were 
found  to  be  in  close  agreement  with  those  of  the  spherical  model  without 
radiation  reaction.  After  the  first  minimum  volume  was  passed,  the 
spheroidal  model  predicted  a  slightly  greater  energy  loss  than  the 
spherical,  until  the  spheroidal  bubble  passed  through  a  second  minimum 
volume . 
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The  most  dramatic  ditterence  between  the  spheroidal  and  spherical 
models  is  in  the  role  of  radiation  reaction.  As  can  be  seen  in  Fig-  11, 
the  iaclusion  of  radiation  reaction  in  the  spherical  bubble  model 
decreased  the  amount  of  energy  radiated-  For  the  spheroidal  bubble, 
this  was  the  case  only  until  the  first  minimum  volume  was  passed.  After 
that  time,  the  effect  of  including  radiation  reaction  in  the  calculation 
was  to  increase  the  radiative  energy  loss,  compared  both  to  that  of  the 
spheroidal  model  without  radiation  reaction  and  to  that  of  either 
spherical  model-  Moreover,  since  even  without  the  iaclusion  of  radia¬ 
tion  reaction,  the  spheroidal  model  yielded  a  greater  energy  loss  than 
the  analogous  spherical  case,  it  seeniis  that  this  was  not  an  artefact  of 
the  approximations  used  in  the  computation  of  the  radiation  reaction 
terms-  Whether  the  magnitude  of  the  increase  in  the  energy  loss  which 
occurred  when  radiation  reaction  terms  were  added  to  the  equations  would 
be  as  large  as  that  indicated  by  Fig.  11  is  rather  more  uncertain.  The 
close  correspondence  between  the  predicted  energy  losses  for  the 
spherical  and  spheroidal  models  prior  to  the  first  mlalmum  suggests  that 
the  calculation  was  valid,  at  least  in  the  regime  in  which  the  bubble 
was  nearly  spherical.  However,  as  mentioned  above,  after  the  first 
minimum,  the  actual  shape  of  the  bubble  is  not  really  spheroidal. 
Consequently,  the  actual  radiation  loss  may  be  quite  different  to  that 
calculated  for  a  spheroidal  bubble.  However,  since  the  bubble  is  even 
less  spherical  than  it  is  spheroidal,  on  balance,  it  seems  probable  that 
the  predictions  of  the  spheroidal  model  were  more  accurate  than  those  of 
the  spherical - 

Hicks  (1972)  solved  the  equations  of  motion  for  a  spherical 
bubble,  with  the  addition  of  a  hydrodynamic  drag  term,  for  227,27  kg.  of 
TNT  at  a  depth  of  45.73  metres  below  the  surface,  (That  is,  Hicks  took 
eqs.  [2*11]  as  his  equations  of  motion,  with  as  defined  by  eq-  [2-8] 
and  “  0)*  It  been  observed  that  a  bubble  from  an  explosion  with 
these  characteristics  rises  approximately  3-35  metres  from  the  location 
of  the  explosion  in  Che  time  taken  to  reach  its  first  minimum,  Hicks 
found  that  where  drag  is  the  only  source  of  dissipation,  a  drag  coef¬ 
ficient  of  Cq  =  2.25  had  to  be  introduced  into  the  equations  of  motion 
for  a  spherical  bubble  in  order  to  reproduce  this  behaviour-  In  this 
work,  for  the  initial  solutions  chosen,  it  was  found  that  a  drag  coef¬ 
ficient  of  Cj^  “  1-85  brought  the  predicted  rise  of  a  spherical  bubble 
into  better  agreement  with  observation-  When  radiative  dissipation  was 
also  included,  a  drag  coefficient  of  =  1,6  yielded  better  agreeement 
with  observation. 

In  Figs.  14-23,  the  curves  labelled  *  spherical*  were  obtained  by 
taking  eqs.  [2-10]  -  [2.11  ]  as  the  equations  of  motion  for  a  spherical 
bubble  produced  by  227.27  kg-  of  TNT  detonated  45-73  metres  below  the 
surface-  All  of  the  symbols  in  the  legend  for  these  figures  have  the 
same  meaning  as  in  Figs-  2-11-  It  should  also  be  understood  that,  for 
tVie  spherical  bubble,  the  effects  of  drag  were  Ignored  unless  a  value  of 
the  drag  coefficient  is  given  in  Che  legend.  When  drag  was  con¬ 
sidered,  Fj^  was  given  by  eq,  j2.S].  It  should  be  emphasized  that  a  drag 
term  was  incorporated  only  into  the  equations  of  motion  for  a  spherical 
bubble  and  never  into  eqs,  [3.43]  -  [3,A6],  the  equations  of  motion  for 
a  spheroidal  bubble. 
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The  behaviour  of  the  spheroidal  bubble  in  Figs-  14—17  was  very 
similar  to  chat  in  Figs-  2—5.  One  difference  was  the  close  agreement 
among  ail  four  curves  for  the  period  of  the  first  bubble  oscillation- 
As  well,  for  the  second  oscillation,  Che  period  predicted  by  the  sphe¬ 
roidal  model  was  closer  to  that  of  the  spherical  model  with  drag  than 
that  which  was  predicted  by  the  spherical  model  without  drag-  This 
applied  in  the  cases  for  which  radiative  energy  loss  was  considered  as 
well  as  those  cases  for  which  it  was  not.  In  Figs-  18-21,  it  can  be 
seen  that  the  peak  velocities  predicted  by  the  spheroidal  models,  both 
with  and  without  radiative  energy  loss,  were  in  agreement  with  those 
predicted  by  the  spherical  model  with  drag,  most  notably  at  the  first 
minimum. 

It  is  Fig.  22,  however,  which  demonstrates  the  accuracy  of  the 
spheroidal  model-  As  can  be  seen,  the  height  above  the  site  of  the 
original  explosion  predicted  by  the  spheroidal  model  was  in  good  agree¬ 
ment  with  that  predicted  by  the  spherical  model  with  drag,  while  that 
predicted  by  the  spherical  model  in  the  absence  of  drag  disagreed  with 
that  predicted  by  the  spherical  model  with  drag*  Because  the  drag 
coefficient  used  with  the  spherical  model  was  chosen  specifically  to 
force  the  calculated  height  to  agree  with  observation,  the  agreement  of 
the  spheroidal  model  with  the  spherical  model  In  this  case  consitutes  a 
verification  of  the  spheroidal  model-  The  agreement  was  not  as  good  at 
the  second  minimum,  especially  for  the  spheroidal  bubble  without  energy 
loss-  However,  once  radiative  energy  loss  was  added  to  the  spheroidal 
model,  the  curves  exhibited  close  agreement  with  those  for  the  spherical 
model  with  drag. 

Figure  23  compares  the  predicted  energy  losses  of  the  spherical 
and  spheroidal  models-  Orice  again,  it  can  be  seen  that  the  Inclusion  of 
radiation  reaction  terras  increased  the  predicted  energy  loss  for  a 
spheroidal  bubble,  in  contrast  to  the  spherical  model,  for  which  the 
addition  of  reaction  terras  decreased  the  energy  loss.  It  Is  inter-* 
esting,  however,  the  energy  remaining  to  the  spheroidal  bubble  was 
greater  than  that  remaining  to  the  spherical  bubble  when  losses  from 
both  drag  and  radiation  were  included. 

VI.  CONCLUDING  REMARKS 


In  this  work,  a  Lagranglan  for  an  oscillating  ellipsoidal  bubble 
which  is  also  undergoing  translational  motion  has  been  derived,  and 
equations  of  motion  obtained  from  it.  An  expression  for  the  generalised 
dissipative  forces  caused  by  the  radiation  of  sound  by  the  bubble  was 
also  found,  and  incorporated  into  the  equations  of  motion-  This 
equation  was  specialised  to  the  case  of  a  spheroidal  bubble,  and  the 
equations  of  motion  solved  for  some  different  charge  masses  and  depths, 
both  with  and  without  the  effects  of  radiation  of  sound  having  been 
Included . 
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By  comparison  with  the  results  obtained  from  Taylor’s  spherical 
bubble  model,  It  was  shown  that  the  spheroidal  model  retained  the 
successful  features  of  the  spherical  model,  notably  the  prediction  of 
tlie  bubble’s  oscillatory  period,  and  reproduced,  at  least  qualitatively, 
notable  features  of  a  real  bubble’s  behaviour,  such  its  flattening  near 
its  first  minimum,  and  a  slower  rise  time  than  that  predicted  by  the 
spherical  model.  After  the  first  minimum,  the  bubble’s  shape  was  not  as 
well  modelled  by  a  spheroid;  however,  the  results  obtained  at  those 
times  were  still  superior  to  those  from  the  spherical  model,  particu¬ 
larly  with  respect  to  the  rise  time.  For  the  case  cited  by  Hicks 
(1972),  the  spheroidal  bubble  model  produced  results  which  were  in  very 
close  agreement  with  experimental  data.  The  spherical  model  was  capable 
of  similar  agreement  only  for  a  limited  time  and  only  with  the  addition 
of  a  drag  terra.  The  spheroidal  model’s  advantage  arises  from  its 
reproduction  of  the  height  above  the  explosion  site  naturally,  without 
the  addition  of  a  drag  term  which  must  be  determined  from  experimental 
data  for  each  case.  Incidentally,  because  the  spheroidal  model  ignored 
hydrodynamic  drag  completely,  and  still  reproduced  the  observed  be¬ 
haviour  of  the  bubble,  it  seems  likely  that  drag  is  relatively  unimpor¬ 
tant  in  determining  the  bubble’s  motion;  it  appears,  rather,  that  the 
shape  of  the  bubble  is  the  single  most  important  factor. 

It  appears  that  the  spheroidal  model  predicted  a  greater  loss  of 
energy  in  the  bubble  through  the  radiation  of  sound  than  the  spherical 
model*  This  was  found  to  be  most  significant  when  radiation  reaction 
terms  were  included  in  the  radiative  dissipation  function.  In  contrast 
to  the  spherical  model,  the  effect  of  including  radiation  reaction  terms 
was  to  Increase  the  energy  loss. 
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Evolution  in  time  of  a  bubble  in  oil 
(photograph  from  Taylor  (1943)) 


390 


FIGURE  13 


6 

(31  0) 


5 

(16-5) 


(198) 


3 


2 

(105) 


1 

(SO) 


(a)  (b)  (<:) 

Fip.  8.  I'l)i>|(u:r.'|*lr<  Iml.lili'  in  nil  liiUcti  wllli  a  n  vclviiip  ilnim  anm.-ni.  i  imiiilx  r 

i(lfntilii'8ti'>l»ii"">'"''  nr|,lc.i((,piiipli.  J^owrr  nimilior  liii  |)tm'iil  hcsis) ;  np.‘  nl  l.nlil.lc  (in  nisi  o.) 


Evolution  in  time  of  a  bubble  in  oil 
(photograph  from  Taylor  (1943)) 
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STEFAN'S  PROBLEM  IN  A  FINITE  DOMAIN  WITH  CONSTANT  BOUNDARY 
AND  INITIAL  CONDITIONS 

Shunsuke  Takagl 

U.S,  Arny  Cold  Regions  Research  and  Engineering  Laboratory 

Hanover,  N.H.  03755 


ABSTRACT 

Stefan's  problem  In  a  finite  domain  Is  solved  under  constant  boundary 
and  Initial  conditions.  The  solution  is  initially  that  of  the  seml- 
Inflnlte  domain,  transits  through  Infinitely  many  Intermediate  stage  solu¬ 
tions,  and  finally  arrives  at  a  stationary  stage  solution.  An  intermediate 
stage  solution  emerges  when  a  corresponding  lead  time  becomes  effective. 
Because  exponential  singularities  exist  at  the  terminal  of  the  finite 
domain  as  functions  of  time,  lead  times  are  Introduced. 

The  breakthrough  is  achieved  by  two  innovations.  The  first  is  the  use 
of  the  Integral  type  solution  of  the  one-dimensional  heat  conduction  equa¬ 
tion  in  place  of  the  well-used  serial  type  solution.  By  use  of  a  finite 
domain  solution  determined  under  unrestricted  Initial  and  boundary  condi¬ 
tions,  the  integral  type  solution  allows  us  to  find  the  temperature  of  the 
old  phase  in  a  two-phase  Stefan  problem.  The  second  innovation  is  the 
inverse-Laplace-lntegral  type  expression  of  l*^erfc(xy 4ict)  that  is  valid 
for  any  Integer  k,  negative,  zero,  or  positive.  This  fornula  is  used  ^o 
expand  the  Interfaclal  temperature  of  the  old  phase  into  a  series  of  I'^t, 
and  to  sum  up  the  series  of  the  form, 

V  ^k  ^  2nJt  ±  X 
I  i  erfc  — - —  . 

n-0  /4ict 

This  summation  is  enqiloyed  to  evaluate  the  final  steady  temperature. 
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NUMERICAL  ABERRATIONS  IN  A  STEFAN  PROBLEM 
FROM  DETONATION  THEORY** 

G.S.S.  Ludford  &  A. A.  Oyedlran 
Department  of  Theoretical  &  Applied  Mechanics 
Cornell  University,  Ithaca  NY  14853  USA 


ABSTRACT.  The  velocity  of  the  moving  boundary  in  a  Stefan  problem 
from  detonation  theory  is  examined  analytically.  Motivation  comes  from 
computations  in  which  the  velocity  profile  exhibited  cusps  and  terminations 
(i.e.  inability  of  the  computer  to  find  a  velocity).  Here  we  show  that  a 
solution  exists  with  continuous  velocity  and  acceleration  at  all  times  but 
that,  under  certain  circumstances,  another  (singular)  solution  may  bifurcate 
off.  If  a  similar  phenomenon  occurs  in  the  finite-difference  schemes  used, 
the  analysis  suggests  that  the  aberrations  (cusps  and  terminations)  are  due 
to  numerical  inaccuracies.  The  next  step,  apparently  a  difficult  one,  is 
to  modify  the  schemes  so  as  to  follow  the  non-singular  solutions. 

I .  INTRODUCTION .  A  certain  model  of  detonation  waves  leads  to  the 
governing  equations 

F  -[K(t)+F]F  =  F_,  F(-®,t)  =  0,  F(-Ho,t)  =  F  ,  (1) 

F(TO,t)  =  F^,  F^(-0,t)-F^(+0,t)  »  1,  (2) 

where  F*»Fjo  are  given  positive  constants.  Here  x  is  measured  from  the 
moving  boundary  (the  flame  front),  whose  velocity  is  K(t),  and  the  solution 
must  satisfy  the  given  initial  conditions 

F(x,0)  =  F-(x)  with  F_(-«>)  =  0  and  F„(+«)  =  F  .  (3) 

U  U  U  00 

The  problem  (1-3)  is  overdetermined  when  the  velocity  of  the  boundary 
is  prescribed;  Indeed,  the  object  is  to  find  K(t)  along  with  the  solution 
F(x,t).  Numerically  this  is  achieved  by  advancing  the  solution  of  the 
truncated  system  (1,2a)  at  each  time  step,  with  K  as  a  parameter  whose  value 
is  determined  by  the  remaining  equation  (2b). 


There  is  a  steady  solution,  expressible  in  terms  of  exponentials, 
whenever 

2/F  <  F .  <  F  +  2/F  ,  (4) 

OO  OD  00 

but  it  is  (linearly)  unstable  if 

F  /2  +  1/F  +  /f  /4+1/F  <  F.  <  F  +  2/F  .  (5) 

oo  oooo  ooWoooo 

Early  computations  [1]  of  unstable  solutions  give  the  curves  in  figure  1: 
eventually  there  was  breakdown,  i.e.  the  computer  was  unable  to  find  a  value 
of  K,  although  in  some  cases  this  was  preceded  by  the  formation  of  cusps. 

’'“■'Supported  by  the  U.S.  Army  Research  Office. 
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Later  computations  [2],  based  on  more  accurate  numerical  schemes  always  led 
directly  to  breakdown. 


Figure  1.  Computational  aberrations  for  galloping  detonationi  cusps  and 
termination. 

The  object  of  the  present  paper  is  to  show  that  such  cusps  and  break¬ 
downs  are  numerical  aberrations,  i.e.  at  all  stages  of  the  evolution 
described  by  equations  (1-3)  the  velocity  K(t)  exists  and  the  acceleration 
K*(t)  can  be  continuous.  (An  earlier  paper  [3]  concluded  that  K’(t)  was 
discontinuous,  but  that  is  incorrect.)  For  that  purpose  we  shall  put  the 
origin  of  t  at  the  instant  of  Interest,  so  that  the  Initial  conditions  (3) 
are  no  longer  arbitrary  but  must  be  the  end  result  of  the  preceding  evolu¬ 
tion.  This  implies  (Sec.  2)  certain  relations  between  the  special  deriva¬ 
tives  on  the  two  sides  of  the  moving  boundary,  i.e.  on  the  derivatives  of 
Fq  at  X  =  To.  Such  initial  conditions  are  then  shown  (Sec.  3)  to  give  the 
velocity  the  required  properties 

K(+0)  =  K(-0),  K'(+0)  =  K'(-O).  (6) 

Although  there  is  always  a  solution  with  continuous  velocity  and 
acceleration,  in  certain  circumstances  there  is  a  second  (singular)  solution. 
Section  IV  discusses  the  implications  of  this  for  numerical  approximation 
of  the  evolution. 
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II.  EVOLUTIONARY  CONDITIONS.  While  initial  values  could  be  chosen  so 
as  to  violate  the  boundary  conditions  (2) ,  at  any  stage  of  the  subsequent 
evolution  these  conditions  must  be  satisfied.  This  implies 


Fq(TO)  =  F^,  F^(-0)-F^(+0)  =  1 


(7) 


when  Fq(x)  is  the  end  result  of  an  earlier  evolution.  Likewise,  there  are 
relations  Involving  higher  derivatives,  in  the  generation  of  which  the 
differential  equation  (1)  is  also  involved.  We  shall  need  to  consider 
derivatives  up  to  the  fifth  order  and,  in  doing  so,  the  following  consequences 
of  the  boundary  conditions  (2)  will  be  used: 


=  0;  f’-f'*'  =  1, 

X  X 


-  +  -  + 

F  ^  =  F  ,  F  =  F 
xt  xt  xtt  xtt 


(8) 


where,  for  example, 

F^(t)  =  F(+0,t). 


(9) 


The  zeroth-  and  first'-order  relations  (8a, d)  lead  to  the  requirement  (7)* 


The  second-order  relation  follows  from  the  differential  equation  (1) , 
which  gives 


-(K+F^)F+ 


(10) 


at  X  =  +0.  Thus 


F  /F 

XX  X 


=  f'^  /f"^  =  V 

XX  X 


(say) , 


and  the  corresponding  velocity  is 


(11) 


K  =  -(F^  +  V). 


(12) 


The  third-order  relation  comes  from  the  x-derivative  of  the  differential 
equation,  which  gives 


X  4-  +2  + 

F"^  -  (K+F^)F  -  F  =  F 

xt  *  XX  X  XXX 


at  X  =  ^0.  Thus, 

[F  ]  =  V^-tF^], 

XXX  X 

where  the  jump  notation 

[Q]  =  Q“  -  Q'^ 

(for  any  quantity  Q)  has  been  introduced. 


(13) 


(14) 


(15) 


To  obtain  the  fourth-order  relation,  the  second  x-  and  first 
t-derivative  of  the  differential  equation  must  also  be  considered;  these 
give,  for  x  =  +0, 
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(16) 


-  f;!: _ 


XXX  X  XX  xxxx 


-(k+f.)f''' -  k'f'*'  =  f'*’ 

*  Xt  X  xxt 


(17) 


Elimination  of  f"*"  .  and  f"*^  from  equations  (13),  (16)  and  (17)  shows 

XC  XX  L 

that 

(F“  -2VF“  +2VF"^+V^f") /f"  =  (f'*'  -2VF'''  +2VF‘''^+V^f'^) /F*^  =  -A(say)  (18) 

xxxx  XXX  X  X  X  xxxx  XXX  X  x'^  x  VJ/V/ 


and  the  corresponding  acceleration  is 

K'  A. 

An  alternative  expression  is 

-A  =  [F  ]  -  2V[F  ]  +  2V[F^]  +  V^. 

XXXX  XXX  X 


(19) 


(20) 


Finally  the  fifth-order  relation  is  obtained  by  also  considering  the 
third  x-derivative  and  the  second  mixed  derivative  of  the  differential 
equation;  these  give,  for  x  =  +0, 


f"*”  ^-(K+F.)f'*’  -AF'^’f'*’  -  =  f"*" 

XXX t  *  xxxx  X  XXX  XX  xxxxx 


f'‘’^j(k+f^)f‘'‘  ^-k’f'*’  -2f‘^f‘^^=  f'*’ 

xtt  *  xxt  XX  X  xt  xxxt 


(21) 

(22) 


Elimination  of  f"*"^,  f"*^  »  f"*"  ^ 

xt  xxt  xxxt 

leaves  only  the  time  derivatives 

subtraction.  The  result  is 


from  equations  (13),  (16),  (21)  and  (22) 
F^^^,  which  may  then  be  eliminated  by 


[F  ]  =  3V[F 
xxxxx  xxxx 


]-6[F  F  ]  +  7V^[F^]-2[F^]-2V^. 

X  XXX  X  X 


(23) 


The  requirement  (8a)  and  the  relations  (8d) ,  (11),  (14),  (ISa) ,  (23) 
between  special  derivatives  on  the  two  sides  of  the  moving  boundary  provide 
restrictions  on  F^,  if  the  initial  data  (3)  are  to  be  compatible  with  an 
earlier  evolution.  The  velocity  and  acceleration  attained  just  before  the 
initial  instant  are  given  by  the  formulas  (12)  and  (19)  which,  because  they 
contain  only  space  derivatives,  may  be  written  in  terms  of  F^,  Immediately 
after  the  initial  instant  the  velocity  and  acceleration  are  given  by  the 
same  formulas,  so  that  whether  or  not  they  have  discontinuities  depends  on 
whether  or  not  the  spacial  derivatives  do.  Such  infinitely  rapid  changes 
in  derivatives  at  the  boundary  are  effected  by  layers,  so  we  turn  now  to 
their  structure. 


III,  OUTER  AND  INNER  EXPANSIONS,  Let  t  =  0  be  the  instant  at  which 
a  discontinuity  is  supposed  to  occur.  We  shall  show  that  in  fact 


Kq=K(+0),  K^=K'(+0) 


(24) 


exist  and  can  be  equal  to 


K(.0)  -  -(F^+V),  K*(‘0)  =  A,  (25) 

where 

V  =  C_/G_  =  C^/G^,  (26) 

-A  =  (F_-2VT  +2VG^+V^G  )/G  =  (F . -2VT  +2VGf+vV  ) /G_^;  (27) 

we  may  also  write 

-A  =  (F_-F^)-2V(T_-T_|_)+2V(G_+G_|_)+V^.  (28) 

The  formulas  (25)  are  rewrites  of  the  results  (12)  and  (19),  while  the 
expressions  (26-28)  come  from  the  relations  (11)  and  (18)  and  the  expression 
(20)  on  setting 


»  F’ (+0)  =  G  ,  =  F"(+0)  =  C  , 

X  0  -  XX  0  -  XXX 


F  =  fJ,^(+0)  h  F  . 
xxxx  0 

+ 


FJJ'  (+0) 


+ 


(29) 


We  shall  also  need  the  rewrites 


Fq(+0)  =  F^,  G_  -  G_|_  =1,  (30) 

T_  -  -  (G_4G^)  ,  (31) 

F;  -  P*  =  3V(F  -F  )-6(G  T  -G  TJ+7V^(G  4GJ-2(G^-G^)-2V^  (32) 

I  "r  T"T"  “■+  “■'r 


of  the  relations  (7),  (14)  and  (23);  we  have  set 


F 


+ 

xxxxx 


+ 


The  strategy  is  to  seek  a  solution  with 
K  =  Kq  +  Kj^t+. .  -  for  t  >  0, 


in  which 

F  =  Fq(x)  +  tF^(x)  +  t^F2(x)+... 
away  from  the  origin.  The  differential  equation  requires 


(33) 


(34) 


(35) 
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(36) 


h  -  ^0’ 


2F2  -  (V^o)^F5  +  2(K^+F|j)(F'Vf"')  +  4F'F"  +  F"  +  K^F'. 


(37) 


The  relations  (26b),  (27b)  (30-32)  now  ensure  that 

F^(+0)  -  F2(+0)  -  0.  F^(-O)  =  F^(+0),  F^(-O)  -  F^(+0),  (38) 

i.e,  the  boundary  conditions  (2)  are  satisfied  to  the  order  implied,  provided 
the  choices  (25)  are  made  for  and  K^. 

The  task  of  showing  that  K,  K’  exist  and  can  be  continuous  at  all 
times  is  therefore  completed,  but  the  proof  (which  essentially  consists  of 
two  different  ways  of  deriving  the  evolution  relations)  does  not  explain  why 
discontinuities  appear  in  the  computations.  For  that  we  turn  to  a  discussion 
of  potential  boundary  layers,  which  uncovers  an  alternative  (singular) 
solution  in  certain  circumstances. 


The  boundary  layers  that  must  occur  at  the  origin  if  there  is  a 
discontinuity  are  described  by  the  similarity  variable 


5  =  x/t 


1/2 


» 


(39) 


and  we  write 

F  =  fQ(C)+t^^^f^(C)+tf2(C)+t^^%(5)+t^f4(0+t^^^f5(C)+..-  .  (40) 

Substitution  in  the  differential  equation  (1)  and  the  boundary  conditions 
(2) ,  plus  matching  with  the  expansion  (35) ,  then  gives  a  series  of 
differential  problems  for  the  coefficient  functions  f ^.f j^,f 2,f2»f ^5 • 

It  was  shown  in  [3]  that,  provided  the  initial  data  satisfy  the 
relations  (30),  the  first  two  functions  in  this  expansion  are 

fp  =  F*,  f^  =  G-S  for  C  <  0.  (41) 

The  next  term  In  the  expansion  satisfies 

^2  =  Vo  ^0  =  V^*’ 

SO  that 

2 

fo  =  G-K*  +  Ax^\5^+2)  +  A\^\//2)  for  5  $  0.  (43) 

Z  +  U  T  "r 

(2)  (2) 

Here  D  denotes  the  parabolic  cylinder  function  and  A-  ,  B-  are  integra¬ 
tion  constants,  the  latter  representing  the  strengths  of  the  boundary  layers 
that  effect  instantaneous  changes  in  the  curvatures  at  the  origin.  Matching 
with  the  outer  expansion  (35)  gives 

A^^^  =  C_/2,  (44) 

+  + 
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and  then  the  boundary  conditions  (2)  require 

G_(K*+V)  +  D  +  B^^^  =  0.  (45) 

If  G_+G_^  ^  0,  the  (unique)  solution  of  this  homogeneous  system  is 

K*  +  V  =  B^^^  =  0.  (46) 

which  confirms  that  Kq  exists  and  has  the  value  (25a).  Consideration  of  the 
problems  for  and  f^  then  shows  that  has  the  value  (25). 

The  cusps  and  terminations,  found  in  the  computations,  correspond  to 

G_  +  =  0,  i.e.  G_  =  ±1/2.  (47) 

The  solution  of  the  homogeneous  system  (45)  is  then 

B^^^  =  +(Kg  +  V)/2  D  (0),  (48) 

-h  ^  ^ 

where  (in  K*)  is  undetermined-  To  determine  it  we  must  go  to  the  next 
term  in  the  expansion  (40) ,  which  satisfies 

f^  +  ^  ?f^  -  |f^  =  -K*f^  -  (49) 

The  general  solution  is 


^3  “  ^^0^2  ^  '^V4(U|/’^2)  for  5  <  0,  (50) 


where  \  B^  ^  are  integration  constants,  the  latter  enabling  instantan- 

+  + 

eous  changes  in  the  third  derivatives  at  the  origin. 

Matching  with  the  outer  expansion  (35)  gives 

A^^>  =  T_/6,  (51) 

+  + 

and  then  the  boundary  conditions  (2)  require 

-K*(K*+V)/A+D_^(0)B^^^  =  K*^-T_+T^+D_;^^(0)(B^^^+b|^^)//2  =  0.  (52) 

-h 

(3) 

Elimination  of  B_  '  from  these  three  equations  gives 
+ 

K*^  +  3VKg  +  2(T_-T^)  =  0  (53) 

where,  according  to  the  relation  (31), 

T_-T^  =  V^.  (54) 
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(55) 


It  follows  that  either  still  has  the  value  (25a)  and 

-  0, 


or  K*  =  -2V  and  0. 

U  +  + 

We  conclude  that  the  solution  bifurcates  when  =  0;  for  one 

of  the  continuations,  no  boundary  layers  form  at  the  origin  and  K  is 
continuous;  for  the  other,  both  the  second  and  third  derivatives  are  changed 
instantaneously  at  the  moving  boundary  and  there  is  a  jump  in  velocity*  The 
computational  implications  of  this  bifurcation  will  be  discussed  in  Sec*  IV; 
here  we  shall  examine  the  first  continuation  further,  with  a  view  to 
confirming  that  the  acceleration  is  then  also  continuous* 

The  next  terms  in  the  inner  expansion  are  found  to  be 

f,  =  V(3-4T  )5^/8  +  V(1±V^-4T  )/4  ±  K^/4 
4  -  -  1 


+  F  (gVl25^+12)/24  +  ^  |^/l)  ,  (56) 


f  =  -Vf '+(3V^±8T_)C^/24+t6V^-2V±l-4(V^T3)T_  ±  2VK  ]c/8 
5  4  + 

2 

+  F^5  aV205V60)/120  +  e"^  (57) 

+  + 

after  tnatching  with  the  outer  expansion.  Applying  the  boundary  conditions 
(2)  to  f^  shows  that 

±(K  -A)/4  +  D  (0)B^^^  =  B^^^  +  =0,  (58) 

^  +  ~ 

which  should  be  compared  with  the  system  (45)  for  G_  =  ±1/2.  As  there  the 
solution  is  not  unique;  we  may  write  + 


8^^)=  +(K^-A)/4D_5(0), 


(59) 


where  is  uidetermined .  To  determine  it  we  apply  the  boundary  conditions 
to  fg,  and  find 

2V(K^-A)/3A+D_g(0)B^^^=  3V(K^-A)//2-D^^(0)[B^^VBj^^]  =  0  (60) 

when  equations  (27),  (28),  (32)  are  used.  Since  ^  -15/ir/8/2, 

this  homogeneous  system  has  the  unique  solution 


=  A,  B^^^  =  0, 

^  + 

showing  that  the  acceleration  is  indeed  continuous. 


(61) 


412 


REFERENCES 


[1]  Stewart,  D.S.  &  Ludford,  G.S.S.  Near  Chapman-Jouget  detonations. 
Transactions  of  the  First  Army  Conference  of  Applied  Mathematics 
and  Computing,  Washington  D,C.,  1983.  ARO  Report  84-1  (1984), 

pp.  801-811. 

[2]  Oyediran,  A, A*  &  Ludford,  G.S.S.  The  Stefan  problem  of  detonation 
theory.  Transactions  of  the  Second  Army  Conference  on  Applied 
Mathematics  and  Computing,  Washington  D.C.,  1984.  ARO  Report 
85-1  (1985),  pp.  273-283. 

[3]  Ludford,  G.S.S.  Saw-tooth  evolution  in  a  Stefan  problem.  To 
appear  in  Lectures  in  Applied  Mathematics:  Nonlinear  Systems  of 
PDE  in  Applied  Mathematics, 


413 


THE  ROLE  OF  MODELING  IN  AN  INDUSTRIAL  EMVIROWENT 


Vi  jay  K.  Stokes 

General  Electric  Company 
Corporate  Research  and  Development 

ABSTRACT 

In  many  situations  in  industry  --  especially  those  relating  to  the 
development  of  new  concepts  and  machines  --  the  overall  problem  is  not 
sufficiently  well  defined  to  warrant  a  "brute  force"  use  of  the  field 
equations  of  mechanics.  Such  an  approach  can,  in  fact,  be  counterproductive, 
both  from  the  point  of  view  of  the  understanding  achieved  and  the  cost.  It 
is  more  appropriate  to  first  develop  simple  models  to  elucidate  the  physics 
of  the  problem.  Once  the  basic  mechanisms  have  been  understood,  more  refined 
answers  can  be  obtained  by  using  the  methodology  of  engineering  science. 

Two  examples  will  be  used  to  illustrate  this  point:  A  simple  analytical 
model  will  be  used  to  explain  the  apparently  anomalous  motion  in  a  new  orbital 
washer  —  in  which  the  motion  of  clothes  is  exactly  opposite  to  what  might  be 
expected  in  such  a  machine.  In  the  second  example,  simple  beam  theory  will  be 
used  to  develop  an  energy  absorbing  concept  for  thermoplastic  automotive 
bumpers,  which  overcomes  a  major  shortcoming  in  existing  bumpers,  namely  their 
inability  to  absorb  significant  amounts  of  energy  for  impacts  over  the  supports. 
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CONDENSATION  ON  FRACTALS  SETS 

J .  S .  Geronimo 
School  of  Matheinatics 
Georgia  Institute  of  Technology 
Atlanta,  Georgia  30332  USA 


ABSTRACT ,  Araong  the  many  rich  directions  of  study  con¬ 
nected  with  fractal  sets,  there  is  the  theory  of  moments  of 
balanced  measures  on  these  sets.  The  balanced  measure 
attaches  equal  weight  to  each  of  the  mappings  v7hich  generate 
the  fractal  set.  This  implies  in  many  cases  that  the  pov/er 
moments  can  be  calculated  explicitly.  One  problem  with 
balanced  measures  is  that  they  are  either  absolutely  or 
singularly  continuous  and  consequently  do  not  give  rise  to 
operators  with  eigenvalues.  Here  we  report  on  a  new  class 
of  sets  and  measures  which  we  call  condensed  fractal  sets 
and  condensed  measures  v/hich  give  rise  to  a  point  spectrum. 
We  will  also  discuss  some  applications  to  specific  physical 
problems . 


I.  INTRODUCTION.  Let  R(z) :  C  ^  C  denote  a  rational 
transformation  of  the  extended  complex  plane  C  =  C  u  {“}  into 
itself  of  degree  N  >  1.  Then  R(z)  =  P(z)/Q(z)  where  P  and  Q 
are  nontrivial  polynomials  with  no  common  factors.  Set 

(z)  =  Ro  R^^  (z)  and  R^  (z)  =  z.  The  Julia  set  J  of 

R(z,)  is  equal  to  the  closure  of  the  set  of  all  repulsive  k 
cycles  of  R  for  all  finite  positive  integers  k. 

Associated  with  the  action  of  R  on  C  there  is  a  unique 
measure  y,  called  the  balanced  measure  for  R,  with  the  pro¬ 
perty  that 

1  ^  -1 

/  f  dy  =  g  /  T.  f(R^  (x))dy  (1) 

J  J  i=l 

V7here  f  e  L^(J,y)  and  R.^(z),  i  =  1,2, ...,N,  denotes  a  com- 

^  -1 

plete  assignment  of  the  branches  of  R  (z).  If  R(z)  is  a 
polynomial  then  the  orthonormal  polynomials  Pj^(x)  associated 

with  y  i.e.,  ?j^  (^)  “  k(n)x^  +  ...,  k(n)  >  0  with 

/  ppx)p-;p=71dp  -  (2) 

J 

satisfy  the  relation  [BGHl] ,  [BM] 
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When  J  is  real  these  polynomials  also  satisfy  the  following 
three  term  recurrence  formula 


a(n+l)p^^^(x)  +  b(n)p^(x)  +  a(n)p^_j^(x)  =  xp^(x)  (4) 

with  Pq  (x)  =  1  and  p_j^(x)  =  0,  The  above  is  just  the  discrete 

analog  of  the  Schrodinger  equation.  Let  A  be  the  infinite 
dimensional  Jacobi  matrix  associated  with  (4),  i.e. , 


then  a  consequence  of  (3)  is  that  A  satisfies  the  following 
renormalization  group  equation  [B] ,  [BGM] , 

D(z-A)^^D*  (Tz-A)“^  .  (5) 


Here  D:  ^  ^2  ijj(nN),  e 

II.  CONDENSED  JULIA  SETS.  Instead  of  diving  into  the 
general  phenomena  of  condensation  we  will  illustrate  it  with 
a  simple  example.  Let  us  consider  the  Julia  sets  J  and  J 

•  •  2  ^ 

associated  v;ith  the  transformations  Rz  ==  z  -X  and  R  z  = 

2  .  ^ 
z  -  A  +  e/z  respectively.  Here  A  >  2.  We  begin  by  recalling 

the  construction  of  J.  Let  I^^^  =  [-a, a]  with  a  =  ^  +Vl  +  ^  , 


this  being  the  largest  fixed  point  of  R. 

I  =  R  1)^  readily  shown  that 

and  that  J  =  lim  I .  j for  n  =  0,1, 


Defining 
J (n)  ^  J (n-1) 

and  2  are  shown 


na-oo 


below. 


-a 


-a  _ 

(1/2) 


a 

11/4) 


(1) 


I 


(1) 


V  A-  a  _ 

(1/2) 


V  A-  a 

{lyvr 


V  A-  a 

(1/4) 


a 

U7W~ 
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( n )  oo 

A  sequence  of  measures  {p  /  _n  which  converge  weakly 

(0) 

to  y  can  be  constructed.  One  begins  by  letting  y  have 

total  mass  1  on  Now  following  (1)  y^^^  will  have  1/2 

its  mass  on  each  of  the  tv;o  intervals  of  y  will 

(2 ) 

have  1/4  its  mass  on  each  of  the  four  intervals  of  I 
etc .  etc . 

Let  us  construct  J^.  when  0  <  e  <<  1.  Let  =  [-a-6,  a+6]  , 

where  a+6  is  the  largest  positive  fixed  point  of  R^(z), 
so  6  >  0  as  e  •>  0.  Let  I  -then  again  = 

lim  I .  I for  n  =  0,1,2  are  shown  below 

n->oo 

l(l) 

^  (1)  ^  a 

jd) 

__  ___ 


1/9  1/9  1/9  1/9  V9  1/9  1/9  ‘  1/9  1/9 

Since  has  three  inverse  branches  we  see  that  the  original 

interval  will  be  split  into  three  disjoint  intervals 

for  e  small  enough,  and  each  of  these  intervals  will  in 
turn  split  into  three  other  disjoint  intervals. 

jT  f  1 

Again  one  can  construct  a  sequence  of  measures  {y  j 
which  converge  weakly  to  y^.  Here  y^^^  has  total  mass  1  one 
while  y^”^  places  a  total  mass  of  1/3^  one  each  of  the 
3^^  intervals  of  I  .  In  the  limit  as  e  0  v;e  see  that 

J  =  J  u  {lim  r“"(0) }.  (6) 

na-oo 

That  is  Jq  is  the  Julia  set  associated  with  the  transformation 

R(z)  =  z^-X  plus  zero  and  all  its  preimages.  Furthermore  the 
measure  associated  with  Jq  becomes 
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(7) 


,n 


dpQ  =  Z 


n=0  ra=l  3 


^  i(x-K^''>)dx 


(n)  •.2 


n 


where  {x  M  denotes  the  preimages  of  order  n  under 
m  m=l  ^  ^ 

R{z)  of  zero,  and  6  (x)  is  the  Dirac  delta  function.  t^e  note 


the  analog  of  (1)  for  iJ„  is 

^  2 

1 


/  f  i  ;  I  £(R.^x)dp  +  i  f(0) 

•^0  ^0  ^=1 


(8) 


f  e  L-"(J,yQ) 


Generalizations  of  the  above  discussion  to  more  general 
rational  functions  can  be  found  in  [BGH2]  and  to  more  general 
mappings  of  compact  spaces  into  themselves  in  [BD] . 


III.  APPLICATIONS . 

a)  Schrodinger  equation  on  a  Sierpinski  lattice  [DABK] ,  [R] . 

We  consider  the  sequence  of  fractal  lattices  shown  below: 


The  boundary  conditions  used  identify  the  corners  of  the  two 
triangles  on  the  largest  scale.  Stopping  at  any  finite  n, 
v^e  consider  the  tight-binding  eigenvalue  equation  in  ^2 

=  e|4j>  (9) 

where  H  is  given  by 

H  =  -  XI  ^  I  ixjHi  j><i|  } 

nearest 

neighbors 

It  can  be  shown  [BGH2]  that  the  spectrum  of  H  is  C  u  {-4,2}, 
where  C  is  the  condensed  Julia  set  for  z  — z  (z+3)  + 

£(z^-l)~^.  Let  and  be  the  condensed  measures  for 

z  ~z  (z+3)  (z+l)/(z+l)  and  z  -»■  ~z  (z+3)  (z-1)  /  (z-1)  respectively; 
then  the  density  of  states  p  associated  with  H  can  be  written 
as 
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p 


+ 


1 

3 


X 


-2  ' 


Here  x_2  characteristic  function  of  the  set  {-2}. 

b)  An  application  to  orthogonal  polynomials.  Let  {p  } 

n 

be  the  polynomials  given  by  (2)  and  assume  that  J  is  real. 
We  now  construct  the  polynomials  of  the  second  kind 


(X) 


/ 

J 


y 


dy  (y)  . 


These  polynomials  obey  the  relation  [BGH3] 


(x)  = 


^n-l ^ 


Furthermore 
nomials  are 
[BGM] 


/  f  dy 
C 


the  measure  with  respect  to  which 
orthogonal  is  a  condensed  measure 


N  Hf (R. ^ (x) )dy 
=  J  /  _ i _ 

i=l  C  R'  {R~^ix))^ 


these  poly- 
and  one  has 


N 


+  I  r  f  (x  )  , 


n-1 


n 


n 


f  e  (C,y  ) . 


Here  C  is  the  condensed  Julia  set  associated  v;ith  y^^^, 

^Fe  the  zeros  of  R' (x) ,  and  {1  ,  are  predetermined 

n  n-i  n  n=l  '• 

constants  [BGM] . 
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NEWTON'S  METHOD,  JULIA  SETS  AND  CHAOTIC  DYNAMICS 


Edward  R.  Vrscay 
School  of  Mathematics 
Georgia  Institute  of  Technology 
Atlanta,  GA  30332 


Abstract 


The  Schroder  or  Konig  iteration  schemes  for  a  given  polynomial  g(z) 
ifeprEsent  generalizations  of  Newton's  method.  In-  both  schemes,  functions 
S^(z)  are  constructed  so  that  the  iteration  sequence  z^^^  = 

verges  locally  to  a  root  z*  of  g(z)  =0  with  prescribed  order  m.  Patho¬ 
logical  situations  involving  the  iteration  sequence  {z^}  do  exist,  however. 

If  z.-.  e  J,  the  Julia  set  of  rational  function  S  (z) ,  then  the  z  behave 

0  m  u 

chaotically  and  never  converge  to  a  root-  For  ^  J,  it  is  also  possible, 

however,  that  the  z  converge  to  attractive  cycles  other  than  the  roots  z*- 

Associated  with  this  latter  behavior  are  regions  in  a  "parameter  space" 
which  exhibit  the  morphology  and  dynamical  patterns  which  arc  associated 
with  the  classical  Mandelbrot  sets  of  quadratic  maps.  These  types  of 
behavior  are  investigated  with  the  aid  of  microcomputer  plots. 

1.  Introduction 

Consider  the  familiar  and  very  important  Newton  iteration  function, 
constructed  to  determine  the  roots  of  a  polynomial  g(z): 

N(z)  =  z  -  .  (1-1) 

g  (z) 

Clearly,  N(z)  is  a  rational  function.  One  is  interested  in  the  conditions 
for  which  the  iteration  sequence  converges  to  a  root  of 

g(z)*  We  may  ask  several  questions:  What  is  the  set  W'(z*)  of  all  initial 
values  Zq  €  C  for  which  the  sequence  {z^}  converges  to  the  root  z*?  Is  it 

possible  that  the  z  do  not  converge  to  any  of  the  z*?  For  what  e  (E  do 

these  pathological  situations  occur  and  what  is  the  nature  of  these  noncon- 
vergent  sequences  {z  }?  The  classical  theory  of  iteration  of  analytic  func¬ 
tions,  concerned  with  the  behavior  of  sequences  in  the  neighborhood  of  a 
fixed  point,  does  not  attack  such  global  problems.  Here,  the  Julla-Fatou 
theory  of  iteration  of  rational  functions  [10,  15]  (and  its  subsequent 
developments)  provides  an  insight  into  the  dynamics  of  such  iteration 
schemes.  Nonconvergent  sequences  associated  with  Newton’s  method  and  its 
generalizations  may,  for  example,  exhibit  asymptotic  periodic  or  even 
chaotic  behavior,  neither  of  which  may  be  simply  explained  away  as  results 
of  calculations  in  finite  precision  arithmetic.  As  will  be  shown  below, 
when  the  g(z)  constitute  a  one-parameter  family  of  polynomials  and  the 
parameter  is  varied,  the  asymptotic  behavior  of  nonconvergent  sequences  may 
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exhibit  a  cascade  of  period-doubling  bifurcations  eventually  transforming 
to  chaos,  a  characteristic  feature  of  quadratic  [7,  11,  16]  and  polynomial¬ 
like  [8]  maps. 

The  behavior  of  Newton’s  method  as  a  deterministic  dynamical  system 
on  the  real  line  has  been  discussed  in  a  recent  article  by  Saari  and  Urenko 
[19],  Howland  and  Valllancourt  [13]  investigated  the  pathological  attrac¬ 
tive  cycles  which  may  be  encountered  in  Newton’s  method.  Curry  et  al.  [6], 
by  means  of  a  series  of  experiments  and  the  main  concepts  of  Julia-Fatou 
theory,  catalogued  the  behavior  patterns  associated  with  Newton’s  method 
as  applied  to  a  one-parameter  family  of  cubic  polynomials.  This  present 
study  was  motivated  in  part  by  Ref.  [6].  We  consider  a  generalized  family 
of  Schroder  iteration  functions  having  the  form 

m-1 

S^(z)  =  z  +  Z  m  =  2,3,4,...,  (1.2) 

n=l 

and  constructed  so  that  the  iteration  sequence  z  ..  =  S  (z  )  converges 

n+i  m  n 

locally  to  a  zero  ‘z*  of  g(z)  as  0(|z-z*|  ).  The .case  m  =  2  corresponds 

to  the  Newton  method  of  Eq.  (1.1),  When  g(z)  is  a  polynomial,  the  degree 

of  the  rational  function  S  (z)  increases  with  m.  For  m  >  2,  the  S  (z) 

m  .  m 

functions  may  possess  extra  fixed  points  which  are  generally  distinct  from 

the  root^  z*.  If  attractive,  these  points  may  trap  the  Schroder  iteration 

sequence  {z  }. 

n 

The  Schroder  functions  are  discussed  in  Section  2.  In  Section  3 
Newton’s  method  and  its  generalizations  are  examined  in  the  light  of  Julia- 
Fatou  theory  of  iteration  of  rational  functions.  In  Section  4  are  presented 
microcomputer  generated  plots  of  the  basins  of  attraction  of  the  Schroder 

4 

schemes  for  m  =  2  and  3  as  applied  to  the  function  g(z)  =  z  -  1.  In  each 

case  the  common  boundary  of  these  basins  of  attraction  constitutes  the  Julia 

set  of  the  Schroder  iteration  function.  We  also  examine  the  dynamics  of 

Schroder  maps  for  a  one-parameter  family  of  cubic  polynomials.  There  exist 

regions  in  complex  parameter  space  where  critical  points  of  the  S  (z)  are 

m 

attracted  to  points  or  cycles  which  do  not  correspond  to  roots  of  the  g  (z) . 

A 

These  regions  exhibit  the  morphology  and  classical  characteristics  of 
Mandelbrot  sets.  In  Section  5,  another  family  of  iteration  functions  of 
prescribed  order,  the  Konlg  functions.  Is  introduced  and  briefly  examined. 

2.  The  Schroder  Iteration  Functions 


Let  f(z):  I  -►  C  be  analytic  on  a  compact  subset  T  of  the  complex  plane 
1C,  having  fixed  point  p  e  T,  i.e.,  f(p)  =  p.  The  fixed  point  p  is  aXtAjaitti\}Q., 
■cndC^^eAent  or  Aeputi-ivc  depending  on  whether  |f'(p)|  is  less  than,  equal 
to  or  greater  than  one.  If  f'(p)  “  0,  then  p  is  termed  ■&upeACLttfLacX^ve.. 

Given  a  starting  value  Zq  e  T,  we  define  the  Iteration  sequence  by 

z  ,.  =  f(z  ),  n  =  0,1,2 .  Now  assume  that  p  is  attractive,  i.e.,  that 

n+j-  n 

z  p  as  n  '»■  <».  Let  e  =  z  -  p  be  the  error  associated  with  the  nth 
n  n  n 

iterate.  Using  the  Taylor  expansion  of  f(z),  we  have 
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%+l  "  ^n+1  “  P 

*  f(ej^  +  P)  -  f(p) 


(2.1) 


m! 


+  0[(e  ) 
n 


m 


n 


where  m  is  the  smallest  Integer  for  which  ^  0  (usually  m  =  1) . 

Then  f(z)  is  said  to  be  an  ■itz/LCUtCon  function  0(J  OAdZA  m. 


We  now  consider  the  construction  of  iteration  functions  of  prescribed 
order  to  determine  the  simple  roots  of  g(z)  =  0.  The  first  case,  m  =  2, 
corresponds  to  quadratic  convergence  and  can  yield  the  familiar  Newton 
method.  The  point  z*  is  a  zero  of  g(z)  iff  it  is  a  fixed  point  of 
f(z)  =  z  *  h(z)g(z),  where  h(z)  is  an  arbitrary  non-zero  function  analytic 
in  T.  The  problem  is  to  construct  f(z)  so  that  f ' (z*)  =  0.  Since  f'(z)  = 

1  -  h'(z)g(z)  -  h(z)g'(z)  and  g(z*)  =  0,  we  may  choose  h(z)  =  [g'(z)]~^ 
to  give  Eq.  (1.1)  for  g'(z)  #  0.  This  represents  a  generalized  and  non¬ 
geometric  procedure  of  constructing  Newton's  method  over  the  complex  plane. 


Higher  order  iteration  functions  may  be  constructed  in  the  same  spirit. 
The  SdfVCddQA  ItZfmtioYi  (,unction6  [20]  have  been  defined  in  Eq.  (1.2),  with 


c_(z)  = 


r  1  d  ,  n  -  1  1 

dz^  g'(z) 


(2.2) 


n n!  'g' (z) 

The  coef ficiehts  c^(z)  are  analytic  functions  for  g' (z)  ^  0.  The  iteration 

sequences  defined  by  z  .  =  S  (z  )  converge  locally  to  the  zeros  z*  of 
jjj  n+i  m  n  i 

g(z)  as  0(|z-z*|  )  .  To  see  this,  we  assume  g(z)  to  be  analytic  in  T  and 
g'  (z)  0.  The  functions  S^(z)  are  analytic  in  T.  For  every  z*  e  T  such 

that  g(z*)  =  0,  it  follows  that  S  (z*)  =  z*  and 


S'(z*)  =  S"(z*)  =  ...  -  S^™”^\z*)  =  0. 
mm  m 

A  proof  of  Eq.  (2.3)  is  given  in  Henrici  [12],  p.  520. 


(2.3) 


The  S^(z) 
m 

series  in  g(z), 
S 


functions  in  Eq.  (1.2)  are  truncations  of  a  general  infinite 
the  first  three  terms  of  which  are  given  explicitly  below: 

2 


®  2[g'(z)]' 


[g(z)]' 


y  [g"(z)]^  -  ^  g'(z)g"'  (z) 
2 - 

[g'(z)]^ 


(2.4) 


The  construction  of  S^(z)  requires  a  knowledge  of  the  first  m— 1  derivatives 
of  g(z).  From  Eq.  (1.2)  we  see  that  only  for  m  =  2,  the  Newton  method,  does 
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the  fixed  point  condition  “  P  l®ply  that  g(p)  =  0.  For  m  >  2,  how¬ 

ever,  it  implies  that  either  (i)  g(p)  =  0  or  (ii)  T^(p)  “  0,  where 


T  (z)  “  Z  c  (z)[-g(z)]”  .  (2.5) 

m  ^  n+i 

n-0 

The  introduction  of  these  extra  fixed  points  may  complicate  the  root-finding 
procedure:  as  repulsive  or  indifferent  fixed  points,  they  alter  the  basins 
of  attraction  for  the  roots;  as  attractive  fixed  points,  they  may  trap  an 
iteration  sequence, 

3.  Julia-Fatou  Theory  and  Schroder  Rational  Iteration  Functions 

The  theory  of  Iteration  of  rational  functions,  originating  in  the 
classical  research  of. Julia  [15]  and  Fatou  [10],  has  witnessed  a  dramatic 
resurgence  of  interest  in  the  last  twenty  years.  A  comprehensive  account 
of  this  research  is  given  in  the  excellent  review  of  Blanchard  [3]-  Details 
of  many  important  proofs  are  given  in  the  paper  by  Brolin  [4].  Some 
important  concepts  and  their  connection  with  Newton-type  iteration  schemes 
are  outlined  below. 

Let  R(2)  be  a  rational  function,  R(z)  -  P(z)/Q(z)  where  PCz)  and 
Q(z)  are  polynomials  with  complex  coefficients  and  no  common  factors,  and 

d  '  deg(R)  E  max{deg(P) , deg(Q) }  ^  2.  The  sequence  of  iterates  {R^(z)}  of 
R(z)  is  defined  by 

R°(z)  =  z,  R^(z)  =  R(z),  R^'^^(z)  =  RCr'^Cz)),  n  =  0,1,2,...  . 

The  inverses  of  R(z)  shall  be  denoted  by  R^^(z),  where  the  subscript  index 

i  =_l,2^,.,,d  enumerates  all  branches  of  the  inverse.  We  now  consider 
R:  C  C  where  I  -  (E  u  {°^}  denotes  the_Riemann  sphere  with  suitably  defined 
spherical  metric.  Given  a  point  z^  e  1C,  the  iteration  sequence 

given  by 

defines  the  ^O^CUid  of  Zq, 

If  R^(p)  =  p  and  r“(p)  ^  p  for  m  <  k,  then  p  is  a  (Jxxed  poXnt  0^  ofidzn 
k.  The  set  of  distinct  points  “  1,2,3, ... ,k} ,  where 

p^  ~  ®-(p)  »  p2  “  F-CPj^)  •  •  •  •  »P]^  ~  ^^^k— 1^  * 

is  termed  a  k-cycle.  If  k  =  1,  p  is  simply  called  a  fixed  point  of  R(z). 

The  k-cycle  is  adtJjiJXdUuoz,  A.ndL^eJiznt,  Oh.  fWpilUiloz,  depending  whether  the 

multipler  l[R*^(p^)]'l  is  less  than,  equal  to  or  greater  than  one,  respectively. 

The  Julia  set  J(R)  of  the  rational  map  R:  C  -»•  5  is  formally  defined 

as  the  set  of  z  e  S  for  which  the  family  of  maps  r”(z)  is  not  normal,  in 
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the  sense  of  Montel  [1].  A  more  working  description  is  that  J(R)  is  the 
closure  of  all  repulsive  k-cycles  of  R(z) ,  k  “  1,2,3,...  .  Its  complement, 

F  »  C\J(R),  the  Fatou  set,  Is  the  set  of  all  z  e  C  for  which  the.  family 

r"(z)  Is  equicontinuous,  in  the  spherical  metric  on  some  neighbourhood 
of  each  point  of  F.  ^ 

Some  important  properties  of  J(R)  are  listed  below: 

(a)  J  0  and  J  is  closed; 

(b)  J  is  invariant  with  respect  to  R,  i.e.,  R(J)  =  J  =  R  (J) ; 

(c)  J(R)  =  J(R™),  m  =  2,3,4,...; 

(d)  If  J  has  interior  points  then  J  =  £; 

(e)  J(R)  is  compact  and  non-denumerable.  In  general,  its  Hausdorff- 
Besicovitch  dimension  is  non-integral,  whereupon  J(R)  is  a  fractal, 
as  defined  by  Mandelbrot  [17]. 

Let  p  be  an  attractive  fixed  point  of  R(z).  The  b(l6^n 

(stable  set)  W(p)  of  p  is  defined  as  the  set 

W(p)  -  {z  e  S|R^(z)  ^  p  as  n  00} . 

The  Jjntmdiatd  A(p)  of  p  is  the  maximal  domain  containing  p 

on  which  the  sequence  of  iterates  {R^}  is  normal.  We  now  have  the  following 
important  property:  boundary  of  W(p)  is  J(R).  It  follows  that  if  R(z) 
has  several  distinct  attractive  points,  then  their  basins  of  attraction 
share  the  same  boundary,  the  Julia  set  of  R(z). 

2 

A  simple  and  illustrative  example  is  afforded  by  the  map  R(z)  -  z  , 

The  unit  circle  C  =  {z:  |z|  =1}  is  Invariant  with  respect  to  R(z)  and  its 
iterates.  All  fixed  points  of  R(z)  and  its  iterates,  except  z  -  0  and  z  =  <» 
lie  on  C  and  are  repulsive.  C  is  the  Julia  set  of  R(z).  The  forward  orbit 
of  any  point  given  by  |z|  <  1  is  the  fixed  point  z  =  0.  The  forward  orbit 

of  any  point  given  by  Izl  >  1  is  the  point  at  infinity.  The  Julia  set  C 

may  be  regarded  as  a  under  the  action  of  the  forward  map  R(z). 

Equivalently,  C  may  be  regarded  as  the  attAacJtoK  6eX  for  the  inverses 

r"^(z)  =  +/?,  R2^(z)  =  -/i. 

Before  the  works  of  Julia  and  Fatou,  Cayley  [5]  began  an  investigation 
of  Newton’s  method  in  the  complex  plane.  Firstly,  he  analyzed  Newton’s 
method  of  determining  the  square  roots  of  unity,  i.e.,  the  zeroes  of 
2 

gCz)  -  z  -  1,  to  ascertain  the  basin  of  attraction  of  each  root.  Here 

N(z)  -  z/2  +  l/(2z).  One  would  expect  that  the  imaginary  axis  J  is  the 

boundary  of  the  two  attractive  basins,  W(+l)  and  W(-l),  since  it  is  easily 
shown  that  N(J)  =  J,  i.e,,  if  z  e  J  then  N(z)  e  J,  Now  consider  the  con^ 
formal  map  T(z)  -  (z^l)/(z+l).  Let  R  =  {z:  Re(z)  >  0}  and 

R  -  {z:  Re(2)  <  0).  Then  TJT  -  C  where  C  denotes  the  unit  circle, 

TR  “  {z:  |z|  <  1},  TR_T~^  =  {z:  |z|  >  1}  and  =  z^.  Furthermore, 

^  2 
T(+l)  =  0  and  T(-l)  -  N(z)  is  conformally  conjugate  to  the  map  R(z)  =  z 
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which  was  our  earlier  example*  It  now  follows  that  under  the  action  of 
N(z),  W(+l)  "  R^,  W(-l)  *  and  J  is  the  Julia  set  of  N(z).  As  such,  J 

is  the  repeller  set  for  N(z)  or  the  attractor  set  under  the  action  of  N^^(z), 

i  =  1,2*  In  the  language  of  Barnsley  and  Demko  [2],  J  is  the  attractor  for 
the  ^uncJtLon  {IPS)  {iC,w^(2),w  (z)},  where 

w^(z)  =  z  +  V  -  1, 

(3.1) 

w  (z)  =  z  -  ^  T* 


The  situation  is  not  so  simple,  as  Cayley  discovered  [9,  18],  for  the 

analysis  of  Newton’s  method  for  higher  roots  of  unity,  i*e*,  =  z^-1, 

n  >  2.  The  Julia  set  J  must  serve  as  a  common  boundary  for  all  W(z^).  In 

other  words,  given  any  point  z  e  J,  then  any  e -neighborhood  of  z  must 
include  points  from  all  the  W(zp .  To  illustrate.  Figure  1(a)  presents  the 

^  4 

basins  of  attraction  for  Newton's  method  for  g,  (z)  -  z  -  1,  The  Julia  set 

4 


boundary  is  a  complicated  curve  which  is  Infinitely  self -similar  —  a  magni¬ 
fication  of  any  region  reveals  further  similar  and  intricate  structure, 
characteristic  of  a  fractal  curve  [17]-  The  Julia  set  includes  points  on 
the  lines  Re(z)  -  ±Im(z)- 


Figure  1(b)  is  an  attractive  basin  map  for  the  Schroder  S  (z)  method 

8  4 

as  applied  to  g^(z)  =  0,  namely,  S^Cz)  =  (21z  + 14z^  -  3) / (32z  ).  The  Julia 

set  boundary  is  similar  in  structure  to  that  of  Figure  1(a)  on  the  diagonals, 
but  with  a  greater  number  of  "petals"  being  nested  in  a  self-similar  fashion. 
A  fundamental  difference  is  noted  in  the  immediate  stable  sets  A(z*)  of 

each  root  as  they  no  longer  extend  into  the  origin  as  in  Figure  1(a).  This 
is  due  to  the  presence  of  four  extra  fixed  points  of  S„(z),  as  given  by 

4 

the  zeroes  of  Eq*  (2-5)-  These  fixed  points  are  solutions  of  z  -3/11  =  0. 
They  are  repulsive  [21]  and  must  lie  on  the  Julia  set  J(S^)-  The  appearance 

of  these  extra  fixed  points  and  their  effects  on  the  basins  of  attraction 
demonstrates  the  caution  that  may  be  necessary  with  the  use  of  higher  order 
iteration  methods.  The  existence  of  repulsive  fixed  points  for  and 

iteration  methods  as  applied  to  g^C^)  0  has  been  discussed  for  general 

n  in  [21]. 


4*  Parcimeter  Space  and  Chaotic  Dynamics 

As  mentioned  earlier,  the  Newton  or  Schroder  methods  are  not  guaranteed 
to  converge  to  zero  of  g(z).  If  the  Initial  point  e  J,  the  Julia  set, 

then  the  sequence  {z^}  will  always  remain  on  J.  Apart  from  this  situation, 

however,  the  possibility  exists  that  the  z^  converge  to  periodic  cycles  or 
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Figure  1 .  Schrgder  basins  of  attraction 
in  Che  complex  region 
black  regions,  W(-i) ; 

(a)  S, 


t-l,lj  X  [-1,1] 
light  grey,  W(-l) ; 

(Newton)  method,  (b)  method 


W(z.)  for  the  roots  of  z  -1=0 
White  regions  constitute  W(i); 


dark  grey,  W(l): 

3 
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exhibit  chaotic  behavior.  Rather  than  trying  to  construct  specific 
examples  of  such  pathological  behavior,  we  may  systematically  examine  the 
iteration  schemes  associated  with  a  parametrized  family  of  polynomials 
g(z).  Here,  we  consider  the  one-parameter  family  of  cubic  polynomials 

g^(z)  “  +  (A-l)z  -  A.  (4.1) 

The  zeros  of  g^(z)  are  z^  =  1,  ^  -  *^1~4A)  / 2 ^  We  shall  now  be 

working  in  a  parameter  space  where  A  e  1.  Each  point  A  =  (Re(A),  Im(A)) 
represents  a  dynamical  system  with  its  own  fixed  points,  Julia  sets  and 
possible  attractive  cycles. 

Curry  et  al.  [6]  first  examined  Newton's  method  in  this  parameter  space 

to  discover  regions  in  A-space  where  attractive  periodic  cycles  exist. 

This  feature  is  also  observed  for  the  higher  order  S  (z)  functions  as  well 

m 

as  the  possible  existence  of  extra  attractive  fixed  points  corresponding  to 
the  roots  of  Eq.  (2.4)  [21].  Here  we  restrict  our  attention  to  the 

(Newton)  and  iteration  schemes,  the  former  of  which  will  serve  as  a 

reference. 

In  order  to  detect  the  existence  of  attractive  cycles  which  could 

interfere  with  the  Schroder  search  for  the  z*,  we  observe  the  orbits  of 

1 

the  CAaJxCjOlZ  pO'tnt^  of  the  S  (z),  i.e.,  those  points  c  e  C  for  which 

m 

S^(c)  =  0.  The  underlying  reason  for  studying  these  special  orbits  rests 

in  the  following  theorem  of  Fatou  [10]:  If  R(z)  is  a  rational  function 
having  an  attractive  periodic  cycle,  then  at  least  one  critical  point  will 
converge  to  it- 

Among  the  critical  points  of  the  S^(z)  are  the  zeroes  which,  of 
course,  are  also  attractive  fixed  points  (1-cycles)  of  the  S^(z) ,  These 

points  are  obviously  not  free  to  converge  to  any  other  attractive  cycle. 

Other  roots,  which  we  shall  call  the  poiniM  c^  are  avail¬ 

able,  however.  The  free  critical  points  for  the  first  two  Schroder  func¬ 
tions  associated  with  the  g^(z)  are  (i)  for  =  0  and  (ii)  for 

SjCz),  =  ±[(A-1)/15]^^^. 

In  order  to  study  the  dynamics  of  these  maps  on  a  microcomputer,  a 
region  of  the  complex  A-plane  was  represented  by  a  grid  of  400x200  points, 
each  point  corresponding  to  a  pixel  of  a  computer  video  terminal.  For  each 
point  A  =  (Re (A),  Im(A)),  a  free  critical  point  c^  was  computed  and  used 

as  an  initial  value  for  the  iteration  sequence,  z  , .  =  S  (z  )-  After  each 

n+1  m  n 

iteration,  the  distance  between  the  iterate  z,  and  each  z?  was  computed- 

^  ^  -4 

If  any  of  these  distances  was  less  than  a  prescribed  value  of  (10  )  the 

sequence  was  assumed  to  converge  to  that  particular  root  and  the  corresponding 


430 


pixel  was  colored  accordingly.  If  no  such  convergence  was  observed  after  a 
prescribed  number  of  iterations  (typically  200),  then  the  grid  point  was 
left  black.  The  resulting  black  areas  represented  regions  in  parameter 
space  for  which  additional  cycles  existed. 

S ^-Newton  Method 

Figure  2(a)  represents  the  region  of  the  complex  A-plane  [-2,2]  [-2,2], 

White  regions  represent  values  of  A  for  which  the  sole  critical  point 
Cj^  =  0  is  attracted  to  the  root  z*  =  1,  grey  regions  correspond  to  attraction 

to  either  of  the  roots  Small  black  areas,  representing  parameter 

values  for  which  pathological  attractive  cycles  exist,  are  observed  at 
A  ^  (0.31,  ±1.64)  and  (1.01,  ±0.98).  When  magnified,  these  regions  have 
the  same  general  shape  as  the  remarkable  Mandelbrot  bifurcation  sets  [17] 

2 

for  quadratic  maps  R(z)  -  z  -  X,  Four  other  sets  are  detected  on  the  real 
axis  at  A  =  0.26,  0,36,  0.5  and  0.65.  Figure  2(b),  a  magnification  of 
the  region  [0.35,  0.37] x  [0.01,  0.01],  reveals  a  characteristic  Mandelbrot^ 
like  set. 

The  existence  of  stAble  attractive  periodic  cycles  and  regions  of 
period-doubling  bifurcations  corresponding  to  the  region  in  Figure  2(b) 
is  shown  in  Figure  3.  Here,  we  plot  the  asymptotic  trajectory  (z  for 

n  >  10000)  of  the  critical  point  c^^  for  the  range  of  real  parameter  values 

0.35  <  A  <  0.37.  The  calculations  were  performed  on  a  CYBER  180/855  main¬ 
frame  computer  in  double  precision  (32  significant  digits) .  Let  us  examine 
the  dynamics  as  the  parameter  A  is  decreased  from  0.37.  The  free  critical 
point  is  eventually  mapped  to  z*  =  1.  Below  the  critical  value 

A  =  0,362683...,  is  suddenly  mapped  into  a  2-cycle.  As  A  is  further 

decreased  the  2-cycle  becomes  a  4-cycle,  etc.  .  The  bifurcation  to  8-  and 
16-cycles  proceeds  quite  rapidly,  with  an  eventual  transition  to  chaotic 
behavior.  A  three-cycle  is  then  observed,  followed  by  a  return  to  chaotic 
behavior,  etc.  ,  At  A  ^  0,35286  a  sudden  return  from  chaos  back  to  the 
fixed  point  z*  =  1  is  observed.  If  the  Mandelbrot  set  of  Figure  2(b)  is 
superimposed  on  Figure  3,  it  is  seen  that  its  pinch  points  correspond  to 

the  points  at  which  the  2^-cycles  bifurcate. 

S  1 1  e r a  1 1  on_  M_e  tho d 

Figure  4(a)  presents  regions  in  parameter  space  A  e  [-5,5]  x  [-5,5] 
for  which  the  critical  point  is  attracted  either  to  z*  =  1  (white), 

z*  2  (gi^ey)  or  neither  (black) .  The  parameter  space  map  for  the  other 

critical  point  c^  =  is  obtained  from  a  reflection  of  the  regions  in 

this  figure  about  the  real  A  axis.  An  enlargement  of  the  region 
[1-89,  1.95]  X  [-0.03,  0.03]  is  shown  in  Figure  4(b)  along  with  its  reflec¬ 
tion  about  the  real  A-axis.  The  upper  half  (including  real  axis)  of  this 
Mandelbrot-like  set  corresponds  to  A  values  for  which  does  not  converge 
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to  the  zt;  the  lower  half  corresponds  to  the  critical  point  C2-  Unlike 
in  the  Newton  method,  there  are  two  possibilities  for  the  orbits  of  the  c, 
to  be  trapped  away  from  the  roots  z^: 

(i)  convergence  to  a  k-cycle  as  seen  above, 

(ii)  convergence  to  additional  attractive  fixed  points  given  by  the  zeroes 
of  Eq,  (2-5),  where  g(z)  =  g^(z) • 

Figure  5  shows  the  asymptotic  trajectories  of  the  free  critical  point 
in  the  parameter  region  corresponding  to  Figure  A(b), 

5*  Konig  Iteration  Scheme 

Here  we  briefly  describe  another  set  of  iteration  functions  of  prescribed 
order.  Given  a  polynomial  g(z)  with  zeroes  z*,  the  Konig  iteration  functions 

are  defined  as  [14] 


K  (z)  -  z  +  (m-1) 
m 


[l/g(2)] 

[l/g(z)] 


(m-2) 
(m-1)  ’ 


(5.1) 


along  with  the  iteration  sequence  defined  by  z  .,  =  K  (z  )•  If  the  z  are 

n't!  m  n  n 

sufficiently  close  to  zero  z*  of  g(z),  then  z^  z*  as  0(|z^-z*|  )  [14]. 
The  case  m  =  2  again  corresponds  to  Newton’s  method. 


Figure  6  shows  the  basins  of  attraction  for  the  four  roots  of  unity, 

4 

i.e.,  g(z)  =  z  -  1,  for  the  iteration  procedure.  There  are  major  dif¬ 
ferences  between  this  basin  map  and  those  associated  with  the  (or  K^) 
and  iteration  methods  of  Figures  1(a)  and  1(b).  The  immediate  stable  set 

of  each  root  is  much  larger  in  Figure  6  since  the  ’’bubbly**  Julia  set  regions 
are  greatly  compressed  into  the  diagonals.  The  transition  from  the 
K^-Newton  method  to  the  method  is  significant.  Secondly,  the  itera¬ 
tion  function,  as  its  counterpart,  has  four  additional  repulsive  fixed 
points.  Unlike  the  function,  however,  these  points  lie  on  the  diagonal 

lines  Re(z)  “  ±lm(z)  and  not  on  the  real  or  imaginary  axes.  As  such,  they 

do  not  interfere  with  the  immediate  stable  sets  of  the  roots  z*. 

1 

We  now  consider  the  iteration  method  as  applied  to  the  one-parameter 
family  of  cubic  polynomials  g^(z)  given  in  Eq.  (4.1).  Figure  7(a)  repre¬ 
sents  the  region  of  complex  A-space  [^5,5]  x  [-5,5].  As  before,  regions 

1/2 

are  colored  according  to  whether  the  free  critical  point  c^^  =  [(A-l)/15] 
is  mapped  to  z*  -  1  (white),  3  “  neither  (black).  Only  one 

black  region  of  nonconvergence,  centered  at  A  =  (1.99,  -3*26)  is  detectable 
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4c 

Figure  6.  Basins  of  attraction  W(z.)  associated  with  the  Konlg  K 

A  ^  J 

iteration  method  as  applied  to  z  -  1  -  0.  Regions  are  shaded  as  in  Fig. 


Figure  7.  Parameter  space  maps  associated  with  the  Konig  method  as 
applied  to  the  cubic  polynomial  family  g*(z). 


in  this  plot,  A  magnification  of  this  region  is  presented  in  Figure  7(b) 
to  reveal  a  Mandelbrot^like  set.  Trajectories  of  for  sample  parameter 

values  lying  in  this  set  have  been  calculated.  Within  the  largest  cardioid- 
like  region  of  the  Mandelbrot  set,  c^  is  mapped  to  2-cycles,  As  the  para¬ 
meter  A  is  varied  along  the  axis  of  symmetry  a  period-doubling  cascade 
eventually  leading  to  chaotic  behavior  is  observed  as  in  Section  4  for  the 
Schroder  iteration  methods.  Interestingly,  in  no  regions  of  the 

Mandelbrot  set  are  the  c^  observed  to  map  to  fixed  points.  In  fact  it 
can  be  shhwn  [22]  that  there  exist  no  regions  in  the  parameter  space 
where  attractive  fixed  points  other  than  the  z*  exist. 
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ABSTRACT .  We  shall  first  discuss  algebraic  properties 
of  the  iterations  of  polynomials  and  show  that  these  prooer— 
ties  are  related  to  almost  periodic  Schrodinger  operators. 

An  exactly  solvable  model  is  introduced  which  displays 
interesting  features;  almost  periodicity,  singular  spectrum, 
chaotic  states,  exact  renormalization  group. 

I.  INTRODUCTION.  Singular  continuous  spectra  arise  in 
many  different  physical  models,  connected  with  either  fractal 
structures  or  almost  periodic  potentials  in  Schrodinger 
equations.  Physical  problems  in  which  scaling  properties 
play  an  important  role  lead  to  a  detailed  interpretation  of 
the  observed  phenomena  in  terms  of  fractal  structures  [1] , 
such  as  vibration  properties  of  proteins  [2] ,  percolation  in 
discontinuous  thin  films  [3]  and  diamagnetic  properties  of 
superconductors  near  the  percolation  threshold  [4].  Also, 
models  involving  a  fractal  structure  as  underlying  framework 
generate  singular  measures  in  a  natural  way,  as  for  instance 
the  vibration  spectrum  on  Sierpinsky's  gasket  [5]  or  Potts 
models  on  hierarchical  lattices  [6] . 

Another  important  class  of  physical  models  which 
generate  singular  continuous  spectra  is  that  of  the 
almost  periodic  Schrodinger  operators  [7] .  They  appear  for 
instance  in  incommensurate  structures,  conducting  or  super¬ 
conducting  linear  chains  [8] ,  in  the  electronic  properties 
of  crystals  in  a  magnetic  field  [9]  or  more  generally  in 
metal-insulator  transitions,  the  almost  Mathieu  equation  being 
an  example  [10] .  Very  little  is  known  about  the  nature  and 
behavior  of  the  wave  functions  for  states  belonging  to  a 
singular  continuous  spectrum. 

Many  discretized  equations  can  be  considered  as  Poincard 
maps  of  dynamical  systems  with  an  infinite  number  of  degrees 
of  freedom,  an  analogy  which  has  been  used  in  the  Frenkel- 
Kontorova  model  [11] .  In  this  article  we  shall  summarize 
the  relation  between  one  of  the  simplest  dynamical  systems, 
the  iteration  of  polynomial  mappings, and  Schrodinger  operators. 

In  Section  II,  we  introduce  an  invariant  measure  under 
a  polynomial  transformation.  In  Section  III  we  shov/  ortho¬ 
gonality  properties  of  iterated  polynomials,  and  associate 
to  those  a  Hilbert  space  operator  the  spectrum  of  which  is 
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the  Julia  set  of  the  polynomial  transformation.  In  Section 
IV  we  introduce  a  one  dimensional  discrete  Schrodinger  opera¬ 
tor  the  density  of  states  of  which  is  the  invariant  measure. 
In  Section  V  we  show  that  this  operator  has  almost  periodic 
properties  and  that  the  corresponding  eigenstates  display 
chaotic  behavior. 


II.  INVARIANT  MEASURES  UNDER  POLYNOMIAL  TRANSFORMATIONS. 
Let  us  consider "a  polynomial  T(x)  of  degree ”d  written 
in  canonical  form 


T(x)  =  +  ...  +  +  a^.  (II.  1) 

We  further  assume  that  there  exists  a  finite  interval  S  of 
the  real  line  such  that  for  any  x  e  S,  all  the  roots  of  the 
equation  T(y)  =  x  are  real  and  belong  to  S.  This  condition 
imposes  weak  constraints  on  the  real  coefficients  .  ,a.^. 

Consider  an  arbitrary  point  Xq  in  S.  Let  (Xq) 

i  =  1,2,. ..,d  be  the  d  first  preimages  of  Xq,  that  is,  the 

d  different  points  which  are  mapped  to  x_  by  T.  More  generally, 

(n)  ^ 

let  T '  be  the  nth  iterate  of  T 

(x)  =  T(x)  ,  2) 

(x)  =  [T{x)  ]  , 

and  let  ^  ~  l,2,...,d^  be  the  d^  roots  of 

the  equation  t'  ' (x)  =  Xq.  The  set  of  accumulation  points  of 

all  preimages  x^^j  of  Xq  is  the  Julia  set  of  the  polynomial 

T  [12,13,14].  Under  our  hypothesis  this  set  is  real,  con¬ 
tained  in  S,  and  independent  of  Xq. 


Following  Brolin  [14] ,  we  shall  consider  the  asymptotic 
distribution  of  the  predecessors;  we  define,  for  any  n  and  for 
an  arbitrary  (but  fixed) 


du  (x)  =  — 

n 


the  measure 


Six  -  x,„)) 


dx . 


(II. 3) 


This  is  a  discrete  measure  with  equal  weights  on  all  the 
preimages  of  order  n  of  Xq.  Brolin  [14]  asserts  that  the 

sequence  dy^  has  a  limit  in  the  weak  topology,  which  is 

independent  of  Xq,  when  n  goes  to  infinity.  This  limiting 

measure  dy(x)  has  been  recognized  to  have  special  invariance 
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properties;  it  is  invariant  under  T,  and  gives  equal  weight 
to  all  inverse  branches  of  T.  This  is  sometimes  referred  to 
as  the  balanced  property  [15] . 

For  our  purpose  here,  it  is  sufficient  to  define  the 
complexif ication  of  the  above  defined  measure,  which  is  sup¬ 
ported  by  the  Julia  set  contained  in  S.  We  define 


G(z) 

The  invariance  property  is 
equation: 


dy  (X) 

z-s 


reflected 


.  (II. 4) 

by  the  follov/ing  functional 


G(2)  =  ^  T*  (z)G(T(2)  )  .  (II. 5) 

Expanding  G(z)  around  z  =  <»,  we  get: 


G(z)  =  S  ,  (II. 6) 

n=0  z 

where  the  are  the  moments  of  the  measure  dy  (x) 

y^  =  /  x’^dy  (x)  .  (II.  7) 

S 

It  is  easy  to  see  that  (II. 5)  allows  one  to  compute  the 
moments  y^^  recursively,  provided  one  normalizes  yg  to  the 

value  +1. 

The  invariance  properties  of  the  measure  are  best  sum¬ 
marized  by  the  following  identity: 

/  <])  [T(x)  ,x]dy  (x)  =  i  /  I  Z  (|)  (x,tT^  (x)  )  1  dy  (x)  ,  (II.  8) 
S  S  li=l  ^  J 

where  (]>  is  an  arbitrary  measurable  function  of  two  variables. 

III.  ORTHOGONAL  POLYNOMIALS  AND  THE  HILBERT  SPACE 
OPERATOR  ASSOCIATED  TO  A  JULIA  SET.  It  iS  natural  to  intro¬ 
duce  the  set  of  orthogonal  polynomials  associated  to  the 
positive  measure  dy (x) .  We  consider  the  set  of  polynomials 
Pn^^^  of  degree  n  =  0,1,2, ...,<»  with  highest  degree  coef¬ 
ficient  equal  to  1,  which  satisfy 

/  Pjjj(x)  P^(x)dy  (x)  =  ^  .  (III.l) 

s  ^ 

Using  (II. 8)  in  an  appropriate  way,  one  finds  [15,16,17,18, 

19]  that 
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(III. 2) 


Pn(T(x))  =  P^^(x). 

Iterating  (III. 2),  one  gets 

P  (T^^^  (x) )  =  P  .  (x) .  (III. 3) 

^  nd^ 

For  n  =  1  (III. 3)  reduces  to 

Pj^(T^^^  (x) )  =  P  j^(x)  . 

d 

However,  it  can  be  checked  that  Pj^(x)  =  x  when 
written  in  canonical  form.  Therefore 

T^^^  (x)  =  P  i,(x)  .  (III. 5) 

d 

This  is  a  very  remarkable  result,  because  it  explicitly 
states  that  the  iterates  of  any  polynomial  (in  canonical 
form)  are  subsets  of  the  family  of  orthogonal  polynomials 
with  respect  to  the  equilibrium  measure  associated  to  the 
Julia  set  corresponding  to  this  polynomial.  While  the 
iteration  of  polynomials  is  a  very  complicated  nonlinear 
operation,  orthogonal  polynomials  satisfy  a  three-term  linear 
recursive  relation  [20] : 

=  (x-a^)P^(x)  -  R^P^_i(x).  (III. 6) 

It  is  not  difficult  to  obtain  the  and  explicitly  in 

terms  of  the  coefficients  a2»a2/.*.,a^  [19].  Therefore  the 

nonlinear  substitution  of  a  polynomial  into  a  polynomial 
has  been  changed  into  a  linear  operation.  However,  the 
linear  relation  (III. 6)  involves  all  the  interpolating  poly¬ 
nomials  between  the  polynomials  of  the  subfamily  of  degree 

d^  which  are  the  kth  iterates  of  T(x).  Nevertheless  extremely 
interesting  new  points  of  viev/  can  be  derived  from  (II  1. 6). 

Introducing  the  Jacobi  matrix  H  associated  to  the 
three-term  recursive  relation  (III. 6)  as  well  as  a  decima¬ 
tion  operator  D,  acting  on  the  infinite-dimensional  vector 
with  components 

(III. 7) 

where  P  (x)  are  the  set  of  orthonormal i zed  polynomials,  one 
n 

gets  [21,  22] 


(III. 4) 

T(x)  is 


HD  =  D(T(H)  ) 


(III. 8) 


where  D  is  defined  by 
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(III. 9) 


and 


D^n(x)  = 


(X) 


HiKx)  =  xti^(x)  .  (III. 10) 

(III. 10)  expresses  nothing  other  than  the  content  of  the 
three-term  recursive  relation  (III. 6). 

However,  if  (f)  is  an  eigenstate  of  H  with  eigenvalue 
X,  (III. 8)  tells  us  that  is  an  eigenstate  of  H  with 
eigenvalue  T  (x) ,  and  therefore  the  spectrum  of  H  is  invariant 
under  T(x).  That  is,  it  is  the  Julia  set- associated  with 
T  (x)  .  It  can  be  sho\^m  also  that  the  spectrum  is  invariant 

under  the  inverse  map  T  ^(x). 

To  conclude  this  first  part,  we  see  that,  to  any  poly¬ 
nomial  map,  one  can  associate  a  Hilbert  space  operator  the 
spectrum  of  which  is  the  Julia  set  correponding  to  this  map, 
and  whose  eigenstate  is  an  infinite  dimensional  vector  the 
components  of  which  are  nothing  but  the  set  of  orthogonal 
polynomials  with  respect  to  the  equilibrium  measure  defined 
on  the  Julia  set. 

IV.  ONE-DIMENSIONAL  SCHRODINGER  OPERATOR  ASSOCIATED  TO 
A  POLYNOMIAL  TRANSFORMATION.  For  the  sake  of  simplicity  we 
shall  confine  ourselves  to  the  logistic  map 

T(x)  =  x^  -  1  .  (IV. 1) 


We  require  A  >  2  for  the  Julia  set  to  be  real.  In  that 
case  the  Jacobi  matrix  H  is  an  infinite  tridiagonal  matrix 
with  all  elements  zero,  except 

”j,j+l  '  “j+l,j  =  ^  0,1,2,...  (IV.2) 

and  the  Rj^(X)  are  rational  functions  of  X  given  by  the 
recurrence  [17] 


Rq  =  0 


^2n  ^2n+l  ^ 

^2n^2n-l  "  \ 


(IV.3) 


which  fits  for  the  first  few: 


^4  =  1^' 


^  X^  -  X-  1  . 
^5  X-1 


=  A;  R2  =  1;  R3  =  A-1; 


The  Schrodinger  operator  associated  to  the  Jacobi  matrix 
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H,  has  the  following  properties  [21] 

(i)  Its  spectrum  is  invariant  under  both 

T(x)  =  -  X  (IV. 4) 

and  its  two  inverses 

t“^(x)  =  ±/x  +  A.  (IV.  5) 

(ii)  The  integrated  density  of  states  is  the  equilibrium 
measure  dp (x) . 

(iii)  When  X  >  2,  the  spectrum  of  H  is  the  set  K  of  points 
X  (a)  where 


0  (ao/aj_, . . .  ,0^/ . . . ) 


0.  =  ±1 


x(a)  -  0Q  + 


X  +  ^X  +  02  •  •  * 

(iv)  K  is  a  Cantor  set  of  Lebesgue  measure  zero  [14] . 


(IV. 6) 
(IV. 7) 


(v)  The  representation  (IV. 6),  (IV.7)  is  a  well  adapted  coding 

of  K  and  the  action  of  T  is  expressed  on  the  sequences  of 

signs  a  as  the  usual  shift  S 


S(0O'0j^^***^  ~  (0j^f02f 

T (x (a) )  =  X (Sa) - 
-1 


.) 


(IV. 8) 


Similarly  for  the  action  of  T 


0^  “  (ilf0Q^02^/-**) 

T“^(x(a))  =  x(a^) 


(IV. 9) 


Using  the  coding,  one  can  identify  the  measure  dp (x)  as  the 
coin-tossing  probability  measure 


/f(x)dp(x)  =  /  IT  [da^  2  [^  (0jj“l)  6  (a^+1)  ]  }f  (x  (Oq  ,  0j^ ,  02  f  •  •  • ) ) 

n=0 

(IV. 10) 


Therefore  we  see  that  dp  has  no  atomic  part  and  the  action 

of  T  on  the  spectrum  has  the  ergodic  properties  of  a  Bernoulli 

shift. 
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V.  ALMOST  PERIODICITY  AND  BEHAVIOR  OF  THE  EIGENSTATE. 
We  come  now  to  one  of  the  most  remarkable  and  unexpected 
facts  about  the  coefficients  appearing  in  the  Jacobi 

matrix  H.  A  careful  analysis  of  the  recursion  relations 
(IV. 3)  allows  one  to  prove  [17]  that  for  X  >  2 


0  '  ''2n  5  1 
A“1  5  I^2n+1  ^ 

lim  R  ,  ~  • 

p2^+s  ® 


(V.l) 


Also,,  that  for  X  3, 

|R 


,k  "  ^sl  -  -  ^,k  ' 

p2  +s  (X“2) 


(V.2) 


which  shows  that  the  sequence  R^  is  almost  periodic 
Therefore  one  can  expand  R^  in  Fourier-like  series 


R 

n 


r 

p»q 


exp 


2imT  (2p-fl) 
2^ 


[22]  . 


(V.3) 


Those  properties  can  be  extended  to  complex  values  of  X 
large  enough  and  for  X  real  and  slightly  bigger  than  2  [23] . 
Those  quasi-per iodic  properties  also  extend  to  the  most 
general  polynomial  T(x)  of  degree  d. 

Let  us  end  this  section  by  mentioning  some  properties 
of  the  states  ip  (x)  . 

(i)  Outside  the  spectrum,  increases  exponentially, 

because  \p  ,(x)=ip  (T^^^x))  and  T^^^(x)  goes  to  infinity 
k  n2'^ 

as  when  x  is  outside  the  spectrum. 

(ii)  Inside  the  spectrum  one  gets  the  bound 

\ip  (X)  I  <  U  for  2^"^  <  n  <  2^.  (V.4) 

^  ^  V  »/I+4X-l  ' 


Therefore  we  have  an  explicit  polynomial  bound.  The  Lyapunov 
exponent  [24]  can  be  proven  to  satisfy 

2y  (x)  =  Y  (T  (x)  )  ,  (V.  5) 

which  proves  that  y(x)  vanishes  on  the  spectrum,  consistent 
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with  the  bound  (V.4). 

Finally,  let  us  emphasize  the  following  fact.  On  the 
spectrum  we  have 

ij;  .(x)  =  ij;  (T  (x)  )  .  {V.6) 

p2^  P 

fkl 

However,  when  k  goes  to  infinity,  the  sequence  T  (x)  is 
ergodic  on  the  spectrum.  Therefore  the  sequences  \p  ,  (x) 

p2^ 

have  fully  chaotic  behavior  of  a  precise  type  related  to 
the  Bernoulli  shift  mentioned  above.  Although  the  complete 
behavior  remains  to  be  analyzed  [25] ,  and  it  should  not  be 
excluded  a  priori  that  the  chaotic  behavior  could  be  attri- 

buted  to  the  sampling  (p2  ) ,  we  assert  that  ours  is  the  only 
almost  periodic  discrete  model  in  which  such  an  explicit 
statement  on  the  states  can  be  made  when  the  spectrum  is 
singular  continuous. 

VI .  CONCLUSION .  A  most  fascinating  point  is  to  study 
the  time  dependent  behavior  of  the  corresponding  quantum 
mechanical  system 

ilj(t)  =  e^”S(0)  ,  (VI.l) 

choosing  as  ij;{0)  the  vector  (1,0,0,...)  =  |0>.  The  projec¬ 
tion  of  ij;  (t)  on  ij;(0): 

<i|;  (0)  (0)  >  =  /e^^^dy(x).  (VI. 2) 

Therefore  the  Fourier  transform  of  the  measure  which  for 
large  time  analyzes  the  structure  of  the  fractal  on  which  it 
is  defined  [26]  is  likely  to  give  a  sequence  of  increasing 
times  tj^, t2, t^f  .  .  .t^/ .  .  .  for  which,  no  matter  how  large  t 

is  chosen,  the  probability  that  the  system  is  found  in  its 
initial  state  remains  finite  [27] .  Such  singular  spectra 
will  provide  systems  which  are  intermediate  between  bound 
systems  and  unbound  systems. 
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LARGE  DEFORMATIONS  OF  ELASTOMER  CYLINDERS  SUBJECTED  TO  END 
THRUST  AND  PROBE  PENETRATION 


A.  R.  Johnson*,  C.  J.  Quigley*,  and  I.  Fried** 

Army  Materials  and  Mechanics  Research  Center  (AMMRC) 

(AMXMR  -  SMM) 

Watertown,  MA  02172  -  2719 


ABSTRACT.  The  Army  is  currently  evaluating  new  elastomer  tank  track 
pads.  These  pads  are  repeatedly  loaded  to  large  strains  when  the  tank 
is  traveling  on  paved  highways  and  to  even  larger  strains  when  the 
tank  is  off  the  road  and  the  pad  is  penetrated  by  sharp  objects  in 
the  soil.  In  the  process  of  evaluating  the  new  elastomers,  laboratory 
tests  are  made  of  cylindrical  samples.  In  this  effort  a  numerical 
method  for  analyzing  tests  of  the  cylindrical  samples  is  presented. 
Axisymmetric  triangular  finite  elements  are  used  in  the  analysis  to 
discretize  the  potential  energy  of  the  elastomer.  Gradient  and 
tangent  matrices  for  the  discretized  potenetial  energy  are  computed. 
The  formulation  allows  different  nonlinear  energy  density  functionals 
to  be  used.  Both  end  thrust  and  probe  penetration  problems  are 
solved.  A  penalty  method  is  used  for  the  (no  friction)  contact 
problem  associated  with  the  probe  penetration.  Contours  of  the 
principal  stretch  ratios  and  stresses  are  shown  on  the  deformed 
meshes  for  both  the  end  thrust  and  probe  penetration  problems. 


I NTRODUCTI ON .  The  analysis  of  large  deformations  of  elastomers 
involves  joining  currently  active  research  work  in  numerical  analysis 
and  nonlinear  mechanics [1-5] .  In  this  effort  we  demonstrate  ^  new 
numerical  method  for  analyzing  the  large  deformations  of  axisymmetric 
elastomers.  The  geometrical  relations  necessary  for  describing  the 
stretch  ratios  of  the  deformed  body  in  terms  of  the  undeformed 
geometry  are  given.  Both  the  deformed  and  undeformed  bodies  are 
interpolated  using  the  finite  element  method  and  relationships 
between  these  interpolations  are  used  to  obtain  the  approximations  to 
the  variables  which  describe  the  stretch  ratios.  The  internal  energy 
of  the  deformed  elastomer  is  expressed  as  the  sum  of  separate 
symmetrical  functions  of  the  principal  stretch  ratios,  see  Valanis 
and  Landel  [6]  .  The  internal  energy  is  then  used  to  determine  the 
potential  energy  of  the  deformed  elastomer.  The  energy  gradient  and 
tangent  matrices  with  respect  to  the  finite  element  nodal  variables 
are  then  determined  so  that  the  Newton  -  Raphson  method  may  be 
employed  to  obtain  the  deformed  geometry  (i.e.,  a  minimum  of  the 
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potential  energy  functional).  The  incompressibility  constraint  is 
enforced  by  adding  the  work  done  by  changing  the  element's  volume  to 
the  potential  energy.  End  thrust  is  analyzed  by  enforcing  the 
displacement  of  the  top  boundary.  The  probe  penetration  problem  is 
approximated  by  adding  a  penalty  term  to  the  potential  energy  in 

which  the  distance  between  the  contact  surface  (probe  surface)  and 
the  nodes  which  penetrate  into  the  contact  region  is  minimized  [5]. 


STRETCH  RATIOS  IN  AXISYMMETRIC  SOLIDS.  The  strain  energy  in  a 
deformed  elastomer  solid  can  be  determined  in  terms  of  the  changes  in 
length  of  material  lines  originating  in  the  undeformed  body  [8].  The 
measure  of  this  change  in  length  most  commonly  used  is  the  stretch 
ratio#  defined  as  the  ratio  of  the  deformed  ‘material  line  length  to 
the  undeformed  length.  The  stretch  ratios  in  an  axisymmetric  solid 
can  be  determined  as  follows,  see  Figure  1.  Let  the  coordinate  system 
of  the  undeformed  body  be  the  (a,  6)  system  and  the  system  of  the 
deformed  body  be  the  (r,z)  system  where  the  radial  direction  is  given 
by  the  a#r  coordinates  and  the  axial  direction  by  the  B,z 
coordinates.  Neighboring  points  in  the  undeformed  body  are  located  by 
the  coordinates  (a, 6)  and  (a  +  da,$  +  d$)  respectively.  Deforming  the 
elastomer  results  in  the  above  points  being  mapped  to  (r,z)  and  ( r  _+ 
dr#z  _+  dz)  #  respectively#  in  the  deformed  body.  The  line  segments  6. 
and  represent  the  undeformed  and  associated  deformed  material 

lines  ror  the  direction  e  at  (o,B)  in  the  _undeformed  body.  If  we 
let  s  be  a  coordinate  line  measured  along  and  assume  that  the 

(r#2)  coordinates  along  can  be  described  by  functions  of  s  then  we 
can  write  and  6^  as  follows. 

6j^(e)  =  doo  +  dBB  =  (cose  a  +  sine  B)  ds 

6„(e)  =  dra  +  dzB  =  (r  a  +  z  ^)  ds  (1) 

2  s  s 

We  can  now  determine  the  stretch  ratio  at  (a#B)  in  the  direction  e. 
It  is 

x(e)  =  ITjl  /  |TJ  •  (  '2> 


To  develop  the  finite  element#  we  determine  the  principal  stretch 
ratios  in  terms  of  the  deformed  and  undeformed  coordinates.  We  have 
the  stretch  ratio  at  (a#B)  in  direction  6  using  the  deformed 
coordinates  and  the  auxiliary  system  (9#s).  We  map  the  (9#s) 
coordinates  to  the  (r#z)  coordinates  as  follows#  for  a  given  9. 
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r  =  r  {oi(s)  ,  e  (s)  ) 
z  =  z (a (s) , 6 (s) ) 


(3) 


Using  (2)  and  the  chain  rule  we  have 


X 


2 


(r) 


r  2  ^  2  1 

- 

r  +  z  r  r.  +  z  z„ 

a  0  0  B  a  B 

cose 

9  9 

rr  +  zz  r^  +  z'* 

sine 

a  B  0  B  B  B  _ 

(4) 


2 

The  principal  values  of  x  (0)  are  the  characteristic  values  of  the 
matrix  in  (4) .  They  are 


=  1/2 (A  +  B  +  ((A  -  B)^  + 
=  1/2(A  +  B  -  ((A  -  B)^  + 

where 


A 

B 


and 


+ 

+ 


C  =  r  r„ 
a  e 


+  z  z„ 
a  0 


(5) 


The  third  stretch  ratio,  the  hoop  stretch  ratio,  is  given  by 


22,2 

X3  =  r  /a  (6) 

TRIANGULAR  BILINEAR  ELEMENT,  The  potential  energy  expression  for  a 
deformed  elastomer  is  not  quadratic  in  the  variables  to  be 
determined.  That  is,  the  potential  energy  minimization  problem  is 
nonlinear.  In  this  section  we  describe  a  convenient  way  to  compute 
the  gradient  and  tangent  matrices  which  must  be  repetively  calculated 
in  the  minimization  process.  We  choose  a  triangular  three  node 
element.  The  node  numbering  and  coordinates  for  the  element  are  shown 
in  Figure  2.  We  map  the  undeforraed  triangular  element  to  the  unit 
triangle  shown  in  Figure  3  and  interpolate  (a,P)  over  the  mapped 
triangle  as  follows. 
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o  =  (1  “  C  “  n)  +  “2^  ^2^ 
S  =  -  5  -  n)  +  32?  +  63II 

Using  (7)  we  obtain 


(7) 


dC 

1 

-asj 

da2  - 

da' 

dn 

A 

-dBj 

dot- 

dB 

where 

&  =  da^  dB^  -  da^  dB2 

and 


doj^  =02-012 

daj  "  "1  -  «3 

doj^ 

+  daj 

+  do 

dSi  "83-82 

d82  "  -  ^3 

d3. 

+  d82 

+  dB 

Similarly  we  interpolate  (r,z) 


(8) 


r  =  rj^d  -  ?  -  n)  +  r2?  +  r^  n 
z  =  z^(l  -  5  -  n)  +  Z2?  +  z^n 

from  which 


dr 

'^■^3 

-dr2l 

(35 

dz 

dz^ 

N 

1 

dn 

(9) 


(10) 


Using  (8)  and  (10)  we  can  obtain  r^,  r  ,  z  ,  and  z  in  terms  of  the 
nodal  variables  (r^^,  z^,  z^,  z^J  and  the  undeformed  geometry. 
We  find 
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i  *  1,2,3  (11) 


r 

a 


1 

r«  = - Sr.de. 

®  A  ^  1 


1  1 

z„  =  -  —  s  z.-  <ae.  Zo  =  “  —  ^  z-  «3a. 

“  A  ^  ^  ®  A  ^  ^ 

We  note  the  relations  in  (11)  can  be  used  in  (5)  and  (6)  to  compute 
the  principal  stretch  ratios  X.  (i=l,2,3).  The  quantities  r  ,  z  , 
z  in  (11)  are  constant  in  an  element.  It  is  useful  to  express 
these  quantities  as  vector  dot  products  since  they  are  needed  to 
calculate  the  potential  energy  and  thus  will  be  involved  in  the 
computation  of  the  gradient  and  tangent  matrices.  We  define 


p*^  =  -1/A  [dBj^  0  dB2  0  dB^  0] 
q"^  =  -1/A  (0  dBj^  0  dB2  0  dB3] 
r"^  =  1/A  [do^  0  dci2  0  ^“3  03 
s*^  =  1/A  [0  doj^  0  da2  ® 

then 

T 

r  =  u  p 
a  ^ 

T 

rp  =  u  r 
where 

T 

u  =  (rj^,Zj^,r2,Z2,r3,Z3) 


T 

z  =  u  s 
a 

T 

Zg  =  u  s 


(12) 


(13) 


In  addition,  we  will  need  a  similar  expression  for  the  radius  used  to 
calculate  X3  in  (6).  Since  the  geometry  (i.e.  r  and  z)  are 
interpolated  with  bilinear  functions  we  will  attempt  to  use  one  point 
integration  to  compute  the  expressions  in  the  energy.  The  form  of  r 
at  the  center  of  the  element,  r^,  becomes 


where 


t*^  =  1/3  [1  .0  1  0  1  0] 


We  now  outline  the  computation  of  the  gradient  and  tangent  matrices 
of  the  potential  energy.  From  the  Valanis  and  Landel  form  [6]  of  the 
internal  energy  we  have 

n  =  U(Xj^,X2,X3)  dx  -  W  (15) 
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where 


U(X^,X2,X^)  =  z  F{X.) 

T  *  the  volume  of  an  element 
W  =  the  work  done  by  external  forces 

Using  one  point  integration  we  have 


E  =  irr  AU  -  W 
c 


(16) 


Then  the  gradient  and  tangent  matrices  become 


an 


au  aw 


g  =  —  =  irr  A  — - ^ 

au  3u^  au 


(17) 


and 


a^n 


a^u 


2 

aw 


k  = 


auau 


T*  "c* 


T  T 

3u3u  3u3u 


(18) 


Since  Xj^  =  X^(r^,z^,r^,z^,r^,z^)  =  X^(u)  for  an  element  we  can  then 
write 


u  =  U(u) 


(19) 


and  directly  compute  the  gradient  of  0  as  follows, 


au 


au 


=  •=  z  u .  X .  , 

T  1  I'u 

i=l 


(20) 


where 

U.  =  U, 


xi 
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and 


(21) 


and 


Note,  ^3  uu  “  ®  there  are  eight  coefficient  matrices  in  (21) . 
Using  (5j,(6)  and  (14)  we  have 
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..  =  X.  ^k,  +  X.  „B,  +  X.  „C, 

l,u  1,A  'u  1,B  'u  1,C  'u 


i  =  1.2 


X_  -  t/ a 
3,u 


Similarly, 


^i,uu  ~  ^i,A*'uu  ^  ^i,B®'uu  ^i,C^'uu  ^i,AA^'u^'u  * 

*  h.CC^’u^’J  *  ‘i,AB<'''u®'u^  * 

^i,AC<»'uC.u''  *  ='u*'u'''  ^  ‘l,BC<®'u='u’'  ^  ='u®'u’'' 
i  =  1,2  ( 


X,  =0 
3,uu 


The  expressions  in  (22)  and  (23)  can  be  conveniently  computed  using 
the  following  relations  obtained  from  definitions  above. 

T  T  T 

A  =  u  [pp  +  qq  ] u 
T  T  T 

B  =  u  [rr  +  ss  ]u  (24) 

m  fp  »p 

C  =  u  [pr  +  rp  ] u 
Then , 


A,  =  2[rp+zq] 
^'uu  '  ^tpp’’’  +  qq’’] 


B,  =2 [r„ r  +  2„  s] 
u  6  6 

T  T 

B,uu  =  2[rr  +  ss  1 


^'ll  "" 

UGtpap 

^"UU  “  *^P  ■*■  +  sq 

The  only  remaining  tasks  are  to  determine  the  derivatives  of  the 
stretch  ratios  and  to  perform  the  axisymmetric  integration  of  the 
element  gradient  and  tangent  matrices.  Using  (5)  we  have 


A  -  B 

X  2  .  ,  2 

1  2 


;  nh,A 
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c 


2 ,  A 


(27) 


‘2  ,B 


2,C 


^1^1, A 


^1^1, C 


1,AA 


■1,AB 


‘1,AC 


‘2,AA 


1 

-  X 

1 

X, 


!  -  x/)  -  2(A  -  B)(X^X^_^  - 

4  (Xi^  - 


-  X 


1,A 


-HjA 

2X^2 


2C(X2X2  ^  -  ^i^i  j^) 

(X,2  .  x,2)2' 


^1, A^1,C 


'2  ,  AB 


'2,  AC 


1,BB 


‘1,BC 


■2,BC 


^  ^1,A^  ^2,A^  ^1^1, AA  ^ 

2 


^  ^1,A^1,B  ^2,A^2,B  ^1^1, AB  ^ 


^  ^  ^1,A^1,C  ^2,A^2,C  *  ^1^1, AC  ^ 

^2 


^^1,A^1,B  ^  ^1,B  ^1^1, AB^ 

^1 


^^1,A^1,C  ^1,B^1,C  ^  W,AC  ^ 


^^1,B^1,C  ^2,B^2,C  ^  ^1^1, BC  ^ 

^2 
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'1,CC 


2  2 

-  Xo  )  -  2C(X,X^  ''o-'o 

_ ^ _ -L  X  y  ^  ^  ^  C 


(X^2  .  ,^2)2 


-  X^X„  _) 


-  X 


and 


^2,CC  .  f  ^2,C^  *  ^1^1, CC  ^ 


2  2 

Note,  x^  IS  computed  using  X^  "''  ^2  =- A  +  B 

so  not  needed.  That  is, 

^  /  t5iL5 


IfC 


2 ,  uu 


T  T 

—  (A  +  B  )  -  3^,  3^,  - 

2  ,uu  ,uu  l,ul,u  2,u2,u 


(28) 


Given  any  configuration  (r,z)  we  can  now  compute  the  approximate 
gradient  and  tangent  matrices  of  the  potential  energy  using  the  above 
bilinear  three  noded  triangular  axisymmetric  element.  The  Newton  - 
Raphson  method  can  then  be  used  to  locate  extremum  values  of  the 
potential  energy. 


MATERIAL  MODEL.  The  internal  energy  expression  used  in  this  effort 
is  valid  for  a  nearly  incompressible  solid  undergoing  large 
deformations  [6].  Elastomers  fall  into  this  category.  The  specific 
form  used  is 


U 


A 


1/2  X 


[ln(  ^2  ^3^1 


3 

+  2y  Z  X  .  rin(X  .) 
i=l  ^ 


1] 


(29) 


A  Jf 

where  X,  y  =  the  Lame  constants. 

Many  other  forms  are  available  [7],  We  utilize  the  expression  in 
(29)  to  demonstrate  the  finite  element  algorithm.  The  terms  needed  in 
the  gradient  and  tangent  matrices,  equations  (19)  and  (20),  are 


U. 


X 

—  ln(Xj^X2X2) 
^  i 


+  2y  ln(X.) 


i  =  l,2,3 


1 

— 2  ^  X  [1  -  ln(Xj^X2l2^1  2yXj^^  i  =  l,2,3 


(30) 


and 
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i-1,2  j=2,3  j>i 


A 

X 


X  .  X  . 
1  3 


END  THRUST  AND  PROBE  PENETRATION.  To  demonstrate  the  above  algorithm 
we  solved  two  problems.  The  first  is  the  "end  thrust"  problem  in 
which  a  cylinder  is  compressed  by  first  bonding  its  ends  to  plates. 
The  cylinder  and  plates  are  then  placed  in  a  hydraulic 
tensile/compression  tester  which  forces  the  plates  towards  each 
other.  The  second  is  the  "probe  penetration"  problem  which  involves 
forcing  a  probe  with  a  hemispherical  end  down  the  axis  of  the 
cylinder . 

A.  END  THRUST.  A  cylinder  of  radius  1.0  in  and  height  2.0  in  was 
assumed.  The  Lame  constants  chosen  were  X  *  16,000  psi  and  u  “  160 
psi.  These  values  closely  represent  styrene  -  butadiene 
rubber.  Solutions  were  obtained  by  enforcing  motion  of  the  top 
surface  of  the  cylinder.  Results  are  shown  in  Figure  4  for  a  10% 
reduction  in  height.  The  deformed  mesh  and  profiles  of  ^3 

are  shown. 


B.  PROBE  PENETRATION.  For  this  example  a  cylinder  of  radius  1.0  in 
and  height  1.0  in  were  assumed.  The  values  of  x  and  w  were  the  same 
as  those  used  for  the  end  thrust  problem.  Some  additional  comments 
relating  to  the  method  used  to  represent  contact  are  worthwhile.  A 
penalty  method,  in  which  the  distance  between  nodes  (which  have  moved 
"inside"  the  probe  surface)  and  the  probe  surface  is  minimized,  was 
used  to  model  contact.  One  of  two  terms  was  added  to  the  potential 
energy,  depending  on  the  location  of  the  node.  The  first  term  (see 
Figure  4.)  is 


n  ' 

n 


where 


(31) 


Y  =  p  k  , 

np  znzn 

p  =  a  user  defined  constant  to  weight  the  penalty  term, 

k  =  the  diagonal  term  of  the  tangent  matrix  associated 

2hzn  ^,ith  a  vertical  displacement, 

(r  ,z  )  =  the  coordinates  of  the  node, 
n'  n 

c  =  the  height  to  the  center  of  the  hemispherical  end  of 
the  probe. 


and 

r  =  the  radius  of  the  hemispherical  end  of  the  probe. 
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The  second  term  is 


II 


n 


n 


(32) 


The  n  '  term  is  used  if  the  node  is  inside  the  spherical  end  of  the 
probe" and  the  n  ”  term  for  the  case  when  the  node  is  inside  the 
cylindrical  portion  of  the  probe. 

I  II 

The  gradient  and  tangent  matrices  were  computed  for  and  and 

used  to  modify  the  global  gradient  and  tangent  matrices  when  a  node 
was  in  contact.  After  a  node  was  determined  to  be  in  contact  the 
appropriate  penalty  term  was  applied  for  all  succeeding  probe 
locations  analyzed.  That  is,  the  node  was  not  released.  However,  none 
of  the  nodes  in  contact  moved  outside  the  contact  surface  for  the 
problems  analyzed  here.  The  results  for  a  probe  of  radius  0.25  in 
penetrating  0.30  in  into  the  cylinder  is  shown  in  Figure  5.  Also 
shown  are  results  obtained  using  the  ABAQUS  program  with  a  Mooney  - 
Rivlin  material  given  by 


U 


2  2  -2 
80psi  ^2  ^1  ^2 

2  2 

+  ~ 


3]  +  20  psi 


-2 


(33) 


The  results  shown  in  Figure  5  indicate  that  there  is  relatively  good 
agreement  between  the  ABAQUS  calculations  and  the  calculations  made 
using  the  above  finite  element  algorithm.  Overall  displacement 
profiles  and  the  location  of  zero  hydrostatic  pressure  (i.e. 
approximate  zone  of  near  zero  volume  change)  agree  well. 


CONCLUSION.  A  finite  element  algorithm  was  presented  in  which  large 
axisymmetric  deformations  of  nearly  incompressible  materials  can  be 
determined.  Results  were  presented  for  end  thrust  and  probe 
penetration  problems.  Different  material  models  can  be  accommodated 
in  this  formulation  with  relative  ease.  The  computer  program  written 
to  implement  the  above  finite  element  algorithm  was  checked  by 
comparing  results  obtained  using  it  to  results  obtained  using  the 
ABAQUS  finite  element  code  for  a  probe  penetration  problem. 
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Figuxc  1*  Notfttlon  for  material  line  deformatloas. 


Figure  2.  Node  nunbering  and  coordinates  for  element. 


Fignre  3.  Unit  triangle  used  for  Interpolation 
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ABAQUS 


r  -  displacement  profiles 


Figure  5.  fjobe  Penetration:  Defomied  uesh  at  35%  probe  penetration. 

P  oflles  for  r,z  displacements  and  hydrostatic  pressure, 
(continued  next  page) 
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ABAQUS 

hydrostatic  pressure  profiles 


Figure  5.  Continued. 


A  TECHNIQUE  FOR  CALCULATING  PATH  INTEGRALS 
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I,  INTRODUCTION,  Path  independent  integrals  for  fracture  mechanics 
were  presented  by  Eshelby  [5]  and  Rice  [7] ,  and  a  number  of  papers  extend¬ 
ing  the  range  of  application,  e.g.  Atluri  [1],  Blackburn  [3],  Blackburn  et 
al  [4],  Hellen  [6]  and  Rice  [8],  have  since  appeared. 

The  present  paper  describes  a  finite  element  method  for  calculating 
approximations  to  a  path  integral  Jp  for  fracture' problemsj  involving 
elastic-plastic  deformation  with  hardening.  This  involves  the  use  of 
techniques  based  on  incremental  plasticity  and  allows  approximations  to 
Jp  to  be  calculated  for  contours  surrounding  the  crack  tip..  Recent  comments 
by  Atluri  and  TraJ^ey  emphasise  the  importance  of  being  able  to  calculate 
such  approximatiofis  for  contours  at  varying  distances  from  a  crack  tip, 
see  Atluri  [2]  and  Tracey  [10], 

II.  Jp-INTEGRAL,  Rice  [7]  proved  that,  for  a  homogeneous  two- 
dimens  ional^ela^tic^ody  ,  the  rate  of  decrease  in  potential  energy  with 
respect  to  crack  length  is  equal  to  the  path  independent  integral,  J. 

That  is 


J 


(2,1) 


where  is  the  potential  energy  and  L  the  crack  length.  J  is  defined 
for  a  crack  with  flat  surfaces  parallel  to  the  Xj^-axis,  sec  Fig.  1,  as 


9u  . 

J  E 

r  1 

Wdx,  -T.  -r-i  ds 

2  1 

(2,2) 


where  the  contour  T  surrounding  the  crack  tip,  starts  from  the  lower 
crack  surface  and  continues  in  an  anticlockwise  direction  until  it  ends 
on  the  upper  surface,  W  is  the  strain  energy  density,  u^  are  the  dis¬ 
placements,  and  the  tractions  are  defined  with  respect  to  the  unit 
outward  normal  vector  n,  so  that  T^^  E  a^jnj. 
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Standardisation  Group,  London,  England . 
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Wolverhampton,  WV4  6BW,  England. 
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Fig.  1.  Two-dimensional  crack  with  contour  V 


For  problems  of  linear  elastic  fracture  the  value  of  J  can  be 
used  as  a  fracture  criterion.  In  the  present  work  for  elastic-plastic 
fracture  problems  a  corresponding  integral,  the  Jp-integral,  is  defined 
This  integral  can  be  developed  from  the  J-integral,  (2,2),  by  separating 
what  was  for  the  elastic  case  the  strain  energy  density  W  into  elastic 
and  plastic  components,  and  Wp  respectively.  Thus 

W  =  W.  +  W  ,  (2.3) 

^  P 

where  the  elastic  component  is  given  in  terms  of  the  stress  and 
elastic  strain  components  by 


The  plastic  term,  Wp,  is  defined  as 

e 

f  ^  - 

We  ode 

P  Jo  P 


(2.4) 


(2.5) 


where  a  and  Cp  are  respectively  the  effective  stress  and  effective 
plastic  strain.  Equations  (2.2)  -  (2.5)  together  define  Jp. 

III.  CALCULATION  OF  THE  Jp-INTEGRAL 

Full  details  of  the  method  of  approximating  Jp  throughout  the 
load  history  for  elastic-plastic  deformation  are  given  by  Whiteman 
and  Thompson  [11];  a  description  of  the  algorithm  and  use  of  the 
MODEL  finite  element  code  appears  in  [9] . 

The  finite  element  method  is  applied  using  a  formulation  in  terms 
of  displacements.  With  incremental  plasticity  the  load  is  applied 
incrementally,  and  for  the  kth  increment  the  finite  element  approximation 
to  the  corresponding  increment  of  displacement  is  first  calculated. 
The  associated  increments  of  strain  and  stress  d^^^  and  d^  ^  are  then 
retrieved  and  the  total  displacements  ,  strains  and  stresses 

^k)  ^  for  the  combined  load  up  to  the  kth  increment,  are  calculated. 
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For  the  approximation  of  as  in  (2.2)  -  (2.5),  to  be  calculated, 
a  contour  path  1,  see  Fig*  2,  is  chosen  through  a  ring  of  elements  x 
surrounding  the  crack  tip.  The  total  value  of  the  approximation  (Jp)^ 
xs  calculated  by  summing  the  contributions  from  elements  involvii 


Fig*  2,  The  contour  T  around  the  crack  tip,  point  0.  The  Gauss  points 
are  indicated  by  X  and  T  by  the  dotted  line*  The  upper  crack 
surface  is  the  line  OA 


:n  onto 

is  either 


( e )  ^  * 

segments  T  of  T.  Each  element  Q  is  transformed  in  turn^opto  a 

standard  element  in  the  (^^ ,^^) -plane  so  that  the  image  of 

a  line  = Const,  a  line  = Const  or  a  path  consisting  of  parts  = Const 

and  =  Const . 

.(e) 


Along  the  image  of  T  ,  for  example  = Const,  we  have  that 

(k) 

1 

J 

.(i)  ,(j). 

p.sses  anc.  , 
e 


.  e.,(k)  _  r  .(e)A  (i) 

\  ^3  V^i  *5,  .  G.  , 

J  ^ 


(3*1) 


where  the  (g^  ,^2  )  Gauss  points  and  the  Gj  are  the  corresponding 

weights.  If  the  stresses  and^displacements  at  the  end  of  the  k^^  load 
increment  are  respectively  and  (uf)]^  ,  i,j  =  1,2, 

is  given  by 


then 


.) 


h 

(k)  _  1 


e.(k)  .  e  .(k)  i^h 

(c^i)h  - -  +  (cr,J, 


'l2'h  V" 
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./  e.  (k) 
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where  is  the  plastic  work  term  at  the  end  of  the  increment 

P  -(k) 

^  ,„(k)  \  P  -(k)  ^  (k) 

given  by  (W^  =  j  a  de^ 


For  nonlinear  fracture  problems,  where  incremental  techniques  are 
used  to  model  the  elastic-plastic  deformation,  if  the  total  load  is 
applied  through  L  increments,  then  approximations  to  (Jp)h  ^  cal¬ 

culated  for  each  load  increment  k  =  l,2,,**,L-  In  each  increment  a 
number  of  approximations  are  calculated  for  different  contours  V  which 
are  at  different  distances  from  the  crack  tip  0,  see  Fig.  2.  These 
approximations,  together  with  their  mean  value,  are  used  as  criteria 
for  fracture. 


Approximations  to  Jp  have  been  calculated  for  a  number  of  contours 
r  as  the  load  is  incremented  for  several  two-dimensional  Mode  I  crack 
problems.  In  each  case  a  power  law  relating  effective  plastic  strain 
and  effective  stress  has  been  employed-  Details  of  the  problems  together 
with  some  numerical  results  are  given  by  Thompson  and  Whiteman  in  [9] 
and  [11], 
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ABSTRACT.  An  a  priori,  explicit  algebraic  procedure  for 

identifying  shear  locking  and  excessive '  solution  stiffening  in 
thin  shear-deformable  plates  is  explored.  Only  element  level 
solutions  of  element  Kirchhoff  modes  are  required  to  establish 
the  nodal  degree-of- freedom  constraints.  These  constraints 
clearly  identify  whatever  kinematic  stiffening  might  exist. 

The  methodology  is  demonstrated  by  the  use  of  a  conforming, 
three-node  Mindlin  element.  Several  discretizations  of  square 
plates  are  examined.  The  results  are  compared,  and  fully 
confirmed,  with  the  corresponding  numerical  solutions.  Examples 
of  alternate  discretizations,  which  alleviate  the  locking  effect, 
are  also  presented. 


0 

1 .  INTRODUCTION.  The  locking  phenomenon,  intrinsic  to  C 
penalty  constraint  models,  is  by  far  the  most  severe  detriment  to 
element  performance.  Encountered  in  the  limiting  penalty  regime 
(viz.,  when  the  penalty  parameter  approaches  infinity  while  the 
penalty  strains  diminish  to  zero),  locking  is  evidenced  by 
grossly  erroneous  results.  Various  remedial  techniques  have  been 
employed  in  an  effort  to  abate  'shear  locking'  in 

shear-deformable  flexure  elements  and  mathematically  similar 
'incompressibility  locking'  in  incompressible  elasticity, 

plasticity  and  fluid  flow  formulations.  Notable  improvements 
have  been  achieved  by  discrete  penalty  constraints  [1-5],  reduced 
integration  procedures  [6-22],  improved  penalty  strain 

interpolations  [23-25],  and  penalty  parameter  modifications 
[13,18,26-30] . 


Despite  the  locking-free  characteristics  generally  achieved 
by  these  enhancing  schemes,  evidence  of  locking  under  certain 
d i scretizational  conditions  still  remains  (e.g.,  refer  to 
[17,29,30]).  Presently,  there  exists  a  lack  of  insight  with 
regard  to  the  actual  mechanism  of  locking,  in  general,  and  in 
these,  occasionally  locking  models,  in  particular.  Attempts  have 
been  made  to  predict  locking  on  the  basis  of  a  constraint  index 
criterion  [10]  and,  recently,  by  operational  procedures  involving 
consistency  comparisons  of  discrete  and  exact  equations  of 
equilibrium/motion  [31,32] .  While  the  former  approach  is  only  a 
heuristic  measure  of  locking,  the  latter  methods,  despite  their 
theoretical  merit,  appear  too  cumbersome  for  practical 
appl ications. 
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The  objective  of  our  present  effort  is  to  explore  in  detail 
the  mechanisms  of  shear  locking  by,  what  appears  to  be,  the  most 
direct  and  effective  way  for  a  discretization  assessment 
conducted  a  priori  to  the  full-scale  finite  element  computation. 
Herein,  we  propose  an  explicit,  element-by-element  analysis  of 
element  Kirchhoff  modes  (i.e.,  vanishing  coefficients  of  shear 
strain  polynomial  terms)  which  are  responsible  for  enforcing  a 
nearly  shearless  element  deformation  state.  Such  an  analysis 
involves  simultaneously  solving  a  small  set  of  linear  algebraic 
constraint  equations  for  each  element,  from  which  kinematic 
constraints  upon  individual  degrees  of  freedom  (dof)  are 
determined.  By  starting  with  elements  most  severly  restrained  by 
boundary  conditions  and  marching  across  the  mesh  from  element  to 
element,  one  can  definitively  identify  the  true  origin  of  locking 
within  an  element  and  the  conditions  of  locking  propagation 
throughout  the  whole  discretizational  domain.  In  the  absence  of 
shear  locking,  clear  evidence  of  a  global  (mesh-wide)  stiffening 
effect,  if  such  exists,  can  be  established. 

This  a  priori  analysis  will  be  demonstrated  on  several  thin 
plate  problems  modeled  by  a  conforming,  three-node  (nine  dof) 
Mindlin  element  [29]  (refer  to  Figs.l  and  2  for  the  plate  and 
element  notations) .  The  element,  herein  referred  to  as  MIN3^, 
utilizes  ' ani sopar ame tr i c '  kinematic  interpolations  and  exact 
quadratures  of  all  energy  and  work  terms  in  the  variational 
statement.  It  also  uses  the  standard  Mindlin  plate  shear 

2  2  2 

correction  factor,  k*  (k*.  =  n  /12  for  isotropic,  homogeneous 

plates  [33])  and,  hence,  constitutes  a  true  penalty  constraint 
element  for  which  the  penalty  parameter  becomes  infinitely  large 
as  its  thickness  diminishes  to  zero.  Essentially  the  same 
element,  termed  MIN3  [29],  with  the  shear  stiffness  (penalty 
parameter)  enhanced  by  the  so-called  element-appropriate  shear 

2  2 

correction  factor,  k  =(k^4i)  ,  is  completely  devoid  of  locking 

2 

(it>  is  an  element  shear  correction  dependent  upon  element 
material  and  geometric  properties)  .  Yet,  it  should  not  be 
regarded  as  a  true  penalty  element  owing  to  the  finite-valued 
upper  bound  of  its  'penalty'  parameter  (for  details  of  this 
concept  and  its  implementation,  refer  to  [28-29]).  For 

comparison  purposes,  MIN3's  results  will  also  be  presented. 

In  section  2,  from  the  kinematic  variable  assumptions  we 
derive  explicit  polynomial  expressions  for  MIN3*'s  transverse 
shear  strains  and  bending  curvatures,  and  define  two  distinct 
types  of  Kirchhoff  modes  that  control  element  performance. 

In  section  3,  for  the  purpose  of  demonstrating  the  typical 
MIN3^  behavior  with  its  occasional  shear  locking  characteristics, 
numerical  results  are  presented  for  several  square  plate  problems 
discretized  by  three  standard  mesh  patterns. 

In  section  4,  the  proposed  explicit  procedure  for  identifying 
shear  locking  and  related  excessive  plate  stiffening  is 
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elaborated  on  using  standard  discretizational  patterns.  Closed 
form  solutions,  clearly  exhibiting  the  mechanisms  of  mesh 
constraining,  are  carried  out  for  the  basic  macro-element  and 
refined  multi-element  discretizations.  These  a  priori 

predictions  are  confirmed  throughout  by  the  numerical  solutions 
of  section  3.  The  main  shortcomings  of  some  of  the  considered 
meshes  are  overcome  by  alternative  discretizations  along  plate 
boundaries  (section  5) .  Conclusions  and  some  numerical  results 
with  our  trouble-free  MIN3  element  are  given  in  section  6. 

f 

2 .  KIRCHHOFF  CONSTRAINTS.  The  transverse  shear  strains  are 
derived  in  a  consistent  manner  from  the  assumptions  of  the 
transverse  displacement,  w,  and  normal  rotations,  6^  and  e  (the 

interested  reader  should  consult  [29)  for  details  of  the 
derivation  of  w) ,  which  may  be  written  as 


w=rw  +  Le  +Me, 

—  — - —y 

e=c9f  e=ce 
X  -x'  y  -y 


where 

w'’^  =  {w.},  0  =  {e  .}f  9  *  {0  .}  (i  =  1,2,3) 

—  1  —X  XI  — y  y  1  It, 

are  vectors  of  nodal  dof;  the  shape  functions 


(la) 

(lb) 

(2) 


i.  “  tc.},  L  =  {L^},  M=  {M^}  (i  =  1,2,3) 
are  given  in  terms  of  area-parametric  coordinates  as 

Lj  =  1/8 


-  Vi,3'' 


Mj  -  1/8 

"1,3  * 

=  (c^  +  b^x  +  a^y)/2A,  =  1  (summation  on  i) 

(A  is  the  area  of  a  triangle) 


(3) 


aj  .  -  X.,  bj  =  -  y^,  c.  -  x.y^  -  x^y. 

(i  =  1,2,3;  j  =  2,3,1;  k  =  3,1,2)  . 


(4) 


The  transverse  shear  strains  are  computed  using  (1)  from  the 
relations 


we 


w,  +0 
'x  y 

W ,  +0 

'y  x 


(5a) 

(5b) 


After  performing  the  straightforward  algebraic  manipulations, 
arrive  at  the  following  expressions  for  the  shear  strains 
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(6) 


cor r© spend i ng  to  an  arbitrary-shape  triangle: 

’'xz  =  *0  *1  y 

where 


*0  '  ia  !1  * 


e 

-a  -y 


'‘l  =  Ea  *  2, 


e 

-y 


(7a) 

(7b) 


and  components  of  the  1x3  row  vectors  t  ,  l  ,  m  ,  p  and  q  are 
given  as  ® 


tai  =  b./2A. 


‘ai  '  ‘>i<'>k‘=j  - 


Pai  •=  - 


Pai 


a  ./4A. 


(i  =  1,2,3;  j  =  2,3,1;  k  =  3,1,2). 
Similarly,  may  be  expressed  as 

Vz  =  ®1  *• 


wi  th 


(8) 


(9) 


Rfl  =  w  +  JLb  ^x  il!b 
Bi  =  Efa  lx  +  Sb  ly 


(10a) 

(10b) 


where  components  of  the  1x3  row  vectors  ,  rjv,  Pa,  and  q.  are 

given  by  the  geometric  relations  —o  b  -^b 

tfa,  -  a./2A 

ibi  =  ICj  +  Pj'PjPk  - 

"bi  *  =i<=j<'k  -  "k^j*''®''^ 

Pbi  = 

(i=  1,2,3;  j=  2,3,1;  k=  3,1,2). 


(11) 


REMARK  2.1 


_  Interestingly,  the  shear  strain  approximations  (6) 

and  (9)  satisfy  the  following  three  differential  equations: 

+  Y._.„=0  (12) 


^xz'x“®' 


Y  f  =0  r 

’yz"y 


A-fA 


This  means  that  the  edge  constraint  procedure,  used  to  condense 
out  the  mid-edge  w  dof  [29],  is  equivalent  to  enforcing 
differential  constraints  (12)  across  the  entire  element  domain. 
Further,  the  third  equation  in  (12)  may  be  recognized  as  an  exact 
transverse  shear  equilibrium  statement  for  a  plate  without 
interior  loads.  This  ,of  course,  is  always  the  case  for  MIN3*, 
where  any  applied  load  is  transformed  into  consistent  nodal 
forces.  Thus,  our  'displacement'  element  also  satisfies  one 
plate  equilibrium  equation  exactly. 

In  the  thin  (classical)  plate  regime  as  the  plate 
span-to- thickness  ratio  (L/h)  approaches  infinity,  the  Kirchhoff 
constraints 


Y  ■*•0  and  Y  (13a) 

xz  yz 

2 

are  enforced  over  the  entare  element  domain  assuming  that  A/h 
as  well  (i.e.,  the  element  penalty  parameter,  which  is 

2 

proportional  to  A/h  ,  must  be  large) .  Owing  to  (6,9) ,  the 
Kirchhoff  modes  must  be  enforced  simultaneously,  i.e.. 


Aj^+0,  (note  Aj^=-Bj^) 


(13b) 


The  single  important  feature  of  constraints  (13b)  is  that  in 
their  unrestrained  form  (i.e.,  prior  to  applying  boundary 
restraints)  each  mode  contains  contributions  of  at  least  two 
independent  kinematic  variables  (i.e.,  Ap,  and  Bg  are  functions 


of  w,  e  and  6 


depends  on  6  and 


Such  modes  will  be 


X  y'  1  X  y 

referred  to  as  the  'true'  Kirchhoff  (or  penalty)  modes.  Upon 
their  enforcement,  a  dependence  of  at  least  two  originally 
independent  fields  is  achieved.  On  the  other  hand,  we  shall  term 
a  Kirchhoff  mode  'spurious'  if  it  is  only  in  terms  of  a  single 
kinematic  variable.  This  latter  situation  is 
isoparametric  formulations  (i.e.,  where  w,  0^  and  0^ 

interpolated  by  the  same  order  polynomials;  refer 


inherent 

are 


in 


for 

with 


to  (29] 

further  discussion  on  the  subject)  ,  and  it  is  also  possible 
MIN3^  for  some  geometries  and  boundary  restraints  (due  to  the 
removal  of  certain  dof  from  A.,  B-  coefficients). 


In  order  to  ascertain  the  Kirchhoff  constraint  implications 
upon  the  bending  strain  energy,  it  will  prove  insightful  to 
examine  the  element  normal  and  twisting  curvatures: 


K  =  6  ,  *  b . 0  . /2A , 

X  y'x  1  yi' 


=  e 


X  y 


=  a . 0  ./2A, 


1  XI 


K  =  0  ,  +  0  ,  =  (b.o  .  +  a.0  .)/2A. 

xy  XX  yy  ixi  lyi 

(summation  on  i  is  implied,  i*l,2,3) 


(14a) 

(14b) 

(14c) 


Note  that  the  element  bending  strain  energy  is  a  function  of  the 
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curvatures  (14),  while  the  transverse  shear  energy  is  in  terras  of 
the  shear  strains  (5).  The  two  contributions  are  super iraposed  to 
produce  the  total  element  strain  energy  (for  details  on  the 
strain  energy,  stiffness  matrices  and  load  vectors  for  MIN3* 
consult  [29] )  . 

3.  ST?^NDARD  DISCRETIZATIONS  FOR  SQUARE  PLATES.  To 

ascertain  the  kind  of  behavior  MIN3^  typically  exhibits, 
solutions  for  uniformly  loaded,  simply  supported  and  clamped 
square  plate  ranging  from  moderately  thick  (L/h=10)  to  extremely 

thin  (L/h=10^)  are  carried  out.  The  A,  B  and  C  meshes,  depicted 
in  Figs.  3  and  4,  are  standard  4x4  triangular  element 
subdivisions  for  a  symmetric  plate  quadrant.  The  center 
deflection  results,  normalized  with  respect  to  the  appropriate 
Mindlin  theory  solutions,  are  summarized  in  Table  1. 

Note  that  in  the  simply  supported  case  (Fig.  3)  ,  mesh  B  tends 

9 

to  lock  at  L/h=10’,  with  the  solution  progressively  deteriorating 
as  L/h  increases.  Meshes  A  and  C,  however,  yield  nonlocking 
results  for  this  problem,  although  mesh  C  solutions  are  superior. 
In  the  clamped  case  (Fig.  4),  both  A  and  B  meshes  lock  for 
2 

L/h>10  ,  while  mesh  C  provides  a  relatively  good  solution  that  is 
only  locally  locked. 

Apparently,  mesh  A  produces  no  locking  in  the  simply 
supported  case,  but  locks  under  clamped  conditions.  This 
indicates  the  influence  of  boundary  restraints  on  shear  locking. 
Further,  from  the  comparison  of  the  A  and  B  mesh  results  in  the 
simply  supported  case,  it  becomes  evident  that  locking  is  also 
mesh  dependent. 

In  what  follows  we  shall  demonstrate  that  while  true 
Kirchhoff  modes  yield  non-locking  solutions,  spurious  ones 
generally  produce  a  considerable  stiffening  effect  and  ,in  some 
cases,  full  element  and/or  plate  locking. 

£.  ANALYTIC  MFSH  ASSESSMENTS.  The  procedure  to  be 
undertaken  involves  simultaneously  solving  three  algebraic 
constraint  equations  (13b)  for  each  element,  from  which  the 
implication  on  the  bending  curvatures  (14)  and  bending  strain 
energy  can  readily  be  ascertained.  By  marching  through  the  entixe 
mesh,  from  element  to  element,  all  constraints  upon  individual 
dof  are  solved  for  without  much  algebraic  difficulty.  As  the 
result,  we  shall  be  able  to  pinpoint  locking  and/or  related 
kinematic  stiffening  of  either  local  (at  some  nodes)  or  global 
(throughout  the  mesh)  nature. 

To  assess  the  suitability  of  meshes  A,  B  and  C  for  thin 
plate-bending  modeling  a  priori  to  the  full-scale  finite  element 
computations,  it  is  first  expedient  to  determine  whether  a 
representative  macro-element  model  (Fig.  5)  is  capable  of  a 
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nonlocking  plate  response.  Then  we  shall  carry  out  the  analysis 
for  larger  discretizations  starting  with  elements  that  are  most 
severely  restrained  by  boundary  conditions.  In  this  way,  even 
relatively  large  discretizations  can  be  examined  quickly  without 
computerizing  the  process.  The  practical  examples  are  presented 
below. 


4.1  Mesh  types  A  and  B. 
model  (consisting  of  two 
quadrant  of  a  square  plate, 
coordinates  in  (8,11)  yield 
bending  curvatures: 


Figure  5a  depicts  a  macro-element 
MIN3*  triangles)  for  a  symmetric 
Incorporating  element  nodal 
s  the  following  Kirchhoff  modes  and 


element  P 


”2  - 

+  (a/2)  ( 

(A0-0)  , 

W3 

"2  *  +  ®X3  -  v  *  Va’*'' 

(B^^0), 

®xl 

-  “xa  -  'y2  ^ 

(Aj^-0)  . 

(15a) 

K  = 
X 

"y2  -  V  =  '“x3-®x2>/=' 

Sy 

=  <«x2  -  *xl  +  V3  -  V2>/^- 

(15b) 

element  Q 

W3  - 

”4  *  =^^'®x3  ■  ®x4  *  ®Yl  *  ®y3’*® 

(A0-0)  , 

W4  “ 

"1  *  +  e^^).0 

(B0-0) , 

«x3 

—  0.  +  6,-6.  ■*■0 
x4  yl  y4 

(A3^-0)  . 

(16a) 

K  - 

X 

<%3  -  "y  '  <’x4  -  %!>/’' 

‘"xy 

'  <®X3  -  *x4  -  "yl  ♦  V4>''“- 

(16b) 

A-type  mesh;  Simply  supported  edges  (1-4)  and  (3-4)  and  boundary 

I£Straij2ts_w^^3^4^^^^^2i^x4^y2^y3i^y4^- 

In  element  Q,  all  but  two  dof  are  fixed,  while  element  P  has 
only  three  active  dof.  With  these  boundary  conditions,  (15)  and 
(16)  become 


w 


2 


K 


X 


-(a/2}e^^,  w^  * 

V  =  "xy 


(these  imply  ®x3'^“®yl^ 

=  9i 


(17a) 

(17b) 


6 

If 


x3 

X 


K 


xy 


(6 


x3 


(laa) 

(18b) 


According  to  our  earlier  definition,  constraints  (17a)  and  (18a) 
should  be  regarded  as  true  Kirchhoff  modes  since  each  relates  two 
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independent  kinematic  fields.  The  relation  (18a)  is  also 
theoretically  correct  due  to  the  intrinsic  symmetry  condition  in 
this  plate  problem.  Obviously,  the  bending  energies  are 
nonvanishing  (because  k  #0  and  k  #0  in  element  P,  and  k  ¥2  in 

A  y  xy 

element  Q)  ,  and  there  exists  no  locking  in  either  of  the 
elements. 


the  excessive  boundary  restraints  (only  three  out 
are  not  fixed)  one  would  intuitively  anticipate 

locking.  Evidently,  due  to  the 


Because  of 
of  twelve  dof 

excessively  stiff  results  or  even 
exclusive  enforcement  of  the  true 
not  lock.  On  the  other  hand,  the 

.4 


due 

Kirchhoff  modes  the  elements 
solutions  for  mesh  A 


do 


about  20-%  error  in  the  maximum 
be  regarded  as  unsatisfactory 


corresponding  to  L/h^l0"  exhibit 
deflection  (Table  1)  ,  which  may 

considering  the  relatively  large  number  of  dof  in  the 
discretization.  (Note  that  the  exact  plate  theory  solution  for 
this  problem  requires  only  few  Fourier  series  terms  to  capture 
the  bulk  of  the  answer  [34]  .) 


To  ascertain  the  source  of  the  'solution  stiffening'  for  this 
problem  we  need  only  consider  (for  the  sake  of  avoiding  any 
additional  algebra)  the  linear  Kirchhoff  modes  (A^=-  B^+0) .  For 

example,  it  may  be  convenient  to  examine  first  the  R  and  S 
■*  n  n 

(n=l,2, . . . ,N)  elements  that  are  adjacent  to  the  kinematic 

symmetry  axis  x  (refer  to  Fig.  6a) .  The  linear  Kirchhoff  modes 

for  these  elements  may  be  written  as 


Usi 

for 


xi(n+l)  xin  yi(n+l) 

e.,  ,,-e.  -e.  +0. 

x] (n+l)  x^n  yin  y^n 

ng  the  symmetry  condition  along 

all  n)  gives 


yj (n+l) 

-«■  0 

the  X  ax 


0 


i  s 


(19) 

(20) 

®xin=®' 

®yi  (n+l)  "^^yj  (n+l) 

and  employing  (19a)  in  (20)  we  obtain 


(19a) 


xj (N+l)  xjN  ”x j  (N-1) * 


■0  .«  +  0  -  0  =  A^0 

X]2  yil  y^l 


(20a) 


The  R^  element  constraints  (19a)  are  certainly  spurious,  and 

they  influence  the  linear  Kirchhoff  constraints  of  elements 

just  short  of  locking;  (20a)  enforces  virtually  identical 
dof  along  j-line,  and  this  obviously  produces  a  severe  stiffening 
effect.  In  a  similar  fashion  we  find  that  the  adjacent  row  of 
elements  is  subject  to  the  same  constraining  action  on  0^j^^  dof. 

Moreover,  following  the  same  procedure  one  could  evaluate 
Kirchhoff  modes  along  the  other  kinematic  symmetry  line  (y-axis) 
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and  arrive  at  stiffening  constraints  of  type  (19a)  ,  i.e.. 


e  .  •*■6  .  ♦e,  ♦...^0 
yin  yjn  ykn 


(21) 


Thus,  constraints  (20a)  and  (21)  are  enforced  throughout  the 
mesh  causing  a  significant  stiffening  effect  (only  few  rotational 
dof  remain  independent  because  of  these  constraints)  . 
upon  examining  our  numerical  solution  for  mesh  A,  we  find  that 

relations  (19a, 20a, 21)  are  in  fact  confirmed  (e,g.,  for  L/h=10 


we  have:  ^ ^ j^=1.00011A,  ^^*1 . 00014  a , 

e  .^  =  1.00015A)  . 

XD  5 


A-type  mesh:  Clamped  edges  (1-4)  and  (3-4) .  It  is  a  very 

simple  matter  to  verify,  using  additional  boundary  restraints 

e  =e  _=0  in  (17)  and  (IS) ,  that  a  single  macro-element  model 
y  X  X  j 

will  produce  a  fully  loc)?ing  solution.  Furthermore,  a 

multi-element  model  of  this  type  will  also  lock 
(simply  apply  the  additional  restraints  ®yj 

(20a)).  The  numerical  solutions  for  mesh  A  (Table  1)  are  seen  to 
be  in  exact  agreement  with  our  closed  form  predictions. 


B-type  mesh:  Simply  supported  edges  (3-4)  and  (2-3)  and  boundary 

iestranvt^2^3^^^^  2^^  3^y 

Employing  these  boundary  restraints  in  (15)  and  (16)  gives 


w^^(a/2)ey2r  0y2-^® 


Wj-(a/2)e,4 


K 

xy 


/a 


(22a) 

(22b) 

(23a) 


•'y  “  ®x4^®'  •'xy 


-®x4/^ 


(23b) 


Clearly,  the  vanishing  6^2  ®x4  spurious,  and 

con seouen tly ,  each  element  in  this  mesh  will  lock  (all  dof  and 
curvatures  vanish,  i.e.,  we  have  a  trivial  solution). 


We  can  also  verify  analytically  whether  a  finer  mesh  of  type 

B  has  a  chance  to  avoid  spurious  locking.  To  do  this,  let  us 

consider  elements  P  and  (n=l , 2 , . . . , N)  along  a  symmetry  axis  x 

n  n 

(refer  to  Fig.  6b).  The  linear  Kirchhoff  modes  for  these 
elements  are 


®xi(n+l)  '  ®xin 
®xjn  ”  ®x  j  (n+1)  ^ 


e  .  +  e  . 

yin  yjn 

Vi(n+1) 


*  0 

®yj (n+1) 


0 


(24) 

(25) 


Owing  to  the  kinematic  symmetry  along  x-axis  (Q^in”^ 
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constraint  (24)  takes  a  spurious  form 


G  .  0  . 

yin  y]n 

and  using  (24a)  in  (25)  we  obtain  another 

constraints  on  Q  elements 

^n 


(24a) 

set  of  spurious 


®xjn  ®xj(n+l)  (25a) 

Now,  with  (ij^-jj^)  edge  under  simply  supported  restraints  (i.e., 
ej^jj^=0)  ,  (25a)  becomes 


®xj (N+1) *®xjN^®xj (N-1) 


X  j2 


0  .  =0 
x^l 


(25b) 


Obviously,  the  adjacent  row  of  elements  will  suffer  from  the 
same  spurious  constraints.  Thus  ,  no  matter  what  the  boundary 
condition  on  the  last  row  of  elements,  all  6^  dof  will  vanish. 

By  considering  elements  along  the  other  symmetry  line  (y-axis)  we 
determine  that  all  6^,  dof  also  vanish. 


It  is  then  clear  that  a  B-type  mesh  will  lock  under  simply 
supported  (and  clamped)  restraints  regardless  of  how  fine  the 
discretization  might  be.  (The  fact  that  both  6^  and  are  forced 

to  vanish  implies  zero  bending  energy.  The  deflection,  w,  will 
also  vanish  because  of  its  dependence  upon  rotational  dof  via 
constant  Kirchhoff  constraints  A„,B„*0). 

REMARK  4.1  It  must  be  kept  in  mind  that  for  a  given  plate 
thickness^  however  small,  as  the  number  of  elements  approaches  a 
large  value,  the  element  penalty  parameter  will  no  longer  be 
large.  In  this  case,  Kirchhoff  modes  will  not  be  enforced  on  the 
element  level,  and  we  should  have  a  nonlocking  solution  at  hand. 

4 . 2  C-  type  mesh.  A  typical  macro-element,  C-type 

discretization  for  a”  symmetric  plate  quadrant  is  shown  in  Fig. 
5b.  For  each  of  the  four  MIN3*  elements,  comprising  the  model, 
we  can  write  the  element  Kirchhoff  modes  (13b)  in  an  explicit 
algebraic  form  which  already  incorporates  the  necessary  boundary 
restraints. 


Simply  supported  edges  (2-3)  and  (3-4)  and  boundary  restraints 

w-=w-=w.  =  e  ,  =  6  _=e  -=e  ,  =  e  -,=  e  ..  =  0. 

—2 — 3 — 4 — X 1 — X  2— X  3— y  1— y  3 — y  4  — 

The  Kirchhoff  modes  for  the  four  MIN3*  elements  are 
element  R 


Wj^*  {a/2)Q^^, 

2"5-”i*  (6^2  -  9^5  -  Byj) 

®y2*29y5 


(26) 
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Glement  S 


2W5.  (a/2) (ey2  -  e^5  +*^5) . 


(27) 


element  P 


*x4*2''y5' 

2“5*  "'/2)(»,4  ♦  8^5  -  »y5) 


(28) 


element  Q 


"1  -  ^"5*  '\5  *  V  -  'x4>' 

®x4*2»x5' 

(a/2)e^^ 


(29) 


Solving  (26-29)  simultaneously  yields  the  following  constraints: 


1"  (3/2) 6^2. 

5"®y5'  ®x4"®y2' 


(a/4)  e 


Y2' 


e  _>2e  _ 

y2  y5 


(3n) 


Equations  (30)  clearly  show  a  nonlocking  nature  of  the  solution, 
i.e.,  the  dof  that  are  not  restrained  by  boundary  conditions 
remain  finite  (nonzero) .  Intuitively,  one  could  also  expect  that 
a  larger  mesh  of  this  type  would  not  engender  locking,  and  this, 
in  fact,  is  confirmed  by  our  numerical  solution  for  mesh  C  (refer 
to  Fig.  3  and  Table  1). 


Clamped  edges.  To  achieve  a  clamped  condition,  we  simply 

enforce  two  additional  boundary  restraints  0^2  =6^^  =0  in  (30).  In 

this  case,  all  dof  vanish,  and  the  mesh  is  fully  locked. 

In  contrast  to  the  A-  and  B-type  discretizations,  where  the 
behavior  of  a  large  mesh  follows  the  pattern  established  by  a 
single  macro-element  model,  the  C-type  mesh  (for  a  clamped  plate) 
does  not  exhibit  overall  locking  once  we  go  to  2x2  and  higher 
refinement  levels. 


To  verify  this  analytically,  without  performing  the  finite 
element  numerical  computations,  we  can  proceed  starting  with 
elements  that  form  a  clamped  corner  of  the  plate  (i.e.,  elements 
that  are  most  severely  restrained).  For  example,  let  us  consider 
a  2x2  symmetric  quadrant  discretization  depicted  in  Fig.  6c. 
Here,  edges  (k^^-k^)  and  clamped,  while  (ij^-i^)  and 

(i^-k^)  are  kinematic  symmetry  axes.  Taking  into  account  the 

boundary  restraints  imposed  on  elements  and  (i.e.. 
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where  r=kj^ , kj  ,  j  )  ,  the  Kirchhoff  modes  become 

element  S, 


"m* 


element  P, 


®xnl^" 


(31) 


w„i>  (a/4) (e^^^  -  36^^^), 


(32) 


which  result  in  or  locking  for  the  two  elements.  This 

solution  plus  the  boundary  restraints  at  nodes  and  can  be 
used  to  compute  shear  constraints  for  elements  and  which 

happened  to  be  identical  for  the  two  elements,  i.e., 

elements  Qj^  and 

"j2-  (^/2)(2Vj2  -  *xj2>' 

®xj2"'yj2' 

"j2*  '^/2'  (2«xj2  -  Vj2>  <”) 

Evidently,  there  is  no  locking  in  either  of  the  elements. 

We  proceed  further  along  the  same  lines  and  examine  the 
adjacent  elements  S2  and  P2.  Thus,  we  obtain 


element  S, 


w  +  (a/4)  (0  ,  -  0  ,  )  f 

ml  yml  xml 

6  -  -0 

xml 


element  P, 


(34) 


w 

6 


32" 

xj2" 


("/2) (20y^,  - 

^®yml  ~  ®yj2 


X  j  2 


-  "32*  *  %ml  *  'yj2  '  ’®yml'  <35) 

The  second  of  constraints  (34)  is  spurious.  In  this  case, 
however,  locking  is  only  of  a  local  nature  since  the  other  dof 
remain  nonzero. 


Similarly,  one  could  examine  the  rest  of  the  mesh  and  find 
the  second  spurious  constraint,  namely,  which  is  also 

only  local.  Thus,  the  mesh  possesses  two  spurious  constraints 
that  do  not  trigger  a  global  (mesh-wide)  locking.  The  same  kind 
of  a  local  locking  trend  is  achieved  with  a  larger  mesh,  namely. 
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the  4x4  C-d i sere t i za t i on  of  Fig,  4.  Note  that  the  resulting 
solution  has  a  substantially  greater  error  than  that  in  the 
simply  supported  case  (refer  to  Table  1).  This  is  obviously  due 
to  local  locking  of  the  clamped  plate, 

5.  ALTERNATIVE  BOUNDARY  DISCRETIZATIONS .  From  the  previous 
discussion  it  becomes  apparent  that  boundary  restraints,  element 
geometry,  and  orientation  are  the  contributing  factors  that 
affect  the  condition  of  the  shear  strains.  Shear  locking  and 
overall  solution  stiffening  were  shown  to  emanate  from  spurious 
Kirchhoff  modes.  In  the  following  two  examples  with  uniformly 
loaded  square  plates  we  demonstrate  how  simple  changes  in  the 
boundary  element  orientations  and  geometry  change  the  character 
of  a  solution. 

5.1  Simply  supported  plates.  In  Fig.  7  are  depicted  five 

meshes  for  a  symmetric  plate  quadrant  which  represent  slight 
boundary  element  modifications  of  meshes  A  and  B.  The  results 
are  summarized  in  Table  2.  Interestingly,  the  global  locking 
experienced  by  mesh  B  is  no  longer  present  in  these  alternative 
discretizations.  However,  four  of  these  meshes  exhibit  local 
locking  of  some  rotational  dof  which  are  responsible  for  the 
overall  solution  error, 

5»2  Clamped  plates.  Discretizations  G  and  H  of  Fig.  8  were 

analyzed  under  clamped  boundary  restraints.  Neither  mesh 
exhibits  global  locking  (refer  to  Table  3) ,  however,  local 
locking  takes  place  at  several  nodes. 

6 ♦  CONCLUDING  REMARKS.  In  this  paper  we  elaborated  upon  an 

a  priori,  element-by-element  algebraic  procedure  for  identifying 
such  plate  modeling  shortcomings  as  shear  locking  and/or  related 
solution  stiffening.  Closed  form  solutions  and  numerical 
computations  were  carried  out  for  several  mesh  patterns  and 
boundary  conditions  using  a  conforming,  three-node  Mindlin 
element  (MIN3^) .  The  element  tends  to  lock  occasionally  and,  in 
some  cases,  produces  an  overall  plate-stiffening  effect.  These 
pitfalls  were  shown  to  emanate  from  spurious  Kirchhoff  modes 
which  arise  from  overrestraining  of  the  element  kinematic  field 
by  boundary  conditions.  The  finite  element  solutions  were  found 
to  be  in  complete  agreement  with  the  analytic  a  priori 
predictions. 

The  enhanced  version  of  the  element  (MIN3)  uses  a  finite 
element  shear  correction  device,  which  effectively  relaxes  the 
enforcement  of  spurious  Kirchhoff  modes.  The  approach  also 
ensures  a  well-conditioned  element  stiffness  over  the  entire 

range  of  A/h^  ratios,  and  there  exists  no  shear  locking  to  hinder 
MIN3's  performance.  (Note  that  MlN3*'s  stiffness  becomes 

2 

ill-conditioned  for  very  large  values  of  A/h  ,  thus  requiring 
large  computer-word  lengths  to  avoid  conditioning  errors)  .  For 
the  purpose  of  comparison,  the  MIN3  results  for  the  square  plate 
problems  of  section  3  are  summarized  in  Tables  4  an  5 .  (Several 


483 


important  test  problems  solved  with  this  element  were  reported  in 
[29]  )  . 


Finally,  it  is  worth  commenting  on  the  general  character  of 
the  shear  (or  penalty)  constraints  discussed  herein.  The 
mathematical  form  of  these  constraints  is  virtually  identical  (or 
closely  related)  to  those  of  the  penalty  formulations  for 
incompressible  elasticity,  plasticity  and  fluid  mechanics.  The 
locking  pitfall,  often  hindering  such  penalty  methods,  is  the 
consequence  of  deficient  penalty  strain  approximations,  which 
often  are  subject  to  boundary  restraints  and  .therefore,  further 
constraining  of  the  spurious  nature.  A  close  examination  of 
penalty  modes,  in  the  manner  presented  here  for  the  Kirchhoff 
constraints,  may  be  seen  as  a  necessary  prerequisite  to 
understanding  element  behavior.  Such  an  analysis  may  also  point 
toward  new  and  more  effective  ways  of  improving  these  penalty 
methods, 
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Fig.  2.  Triangular  element  configuration. 
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FIq.  3.  Simply  supported  square  plates:  standard  symmetric-quadrant  meshes. 
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Fig.  4.  Clamped  square  plates:  standard  symmetric-quadrant  meshes. 
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a,  A  &  B-typo  macro"©l©tnont  b-  C'-typo  macro”©lotn©nt 


Fig.  5.  Macro-element  meshes. 


a.  A-type  mesh  fragment 


_ Boundary  restraints 

—  —  Kinematic  symmetry  restraints 


b.  B-type  mesh  fragment 


c-  2x2  C*type  mesh 


Fig.  6.  Fragments  of  standard  meshes. 
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FIq.  7.  Simply  supported  square  plates:  alternative  symmetric-quadrant  meshes. 
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Fig.  8.  Clamped  square  plates:  alternative  symmetric-quadrant  meshes. 
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Table  1 


simply  supported  and  clamped  square  plates  under 
uniform  load;  MIN3^  center  deflection  results 
normalized  with  respect  to  analytic  solution 
according  to  Mindlin  theory  (E»10 , 92x10^ ,  \J=0^3). 


L/h 

simply 

suppor  ted 

clamped 

mesh 

A 

B 

C 

A 

B 

c 

10 

0.981 

0.970 

0.993 

0.909 

0.947 

0,973 

10^ 

0.891 

0.728 

0.990 

0.504 

0.557 

0-935 

10^ 

0.801 

3.8x10"'* 

0.989 

1.2x10“'* 

1,5x10^4 

0-931 

10® 

0.801 

3.8x10"® 

0.989 

1.2x10"® 

1.5x10"® 

0.931 

Table  2^.  Simply  supported  square  plates  under  uniform  load; 

MIN3^  center  deflection  results  normalized  with 
respect  to  analytic  solution  according  to  Mindlin 
theory  ( E=10 , 92x10^  ,  u^0.3). 


L/h 

mesh 

D 

E 

F 

G 

H 

10 

0.971 

0.971 

0.975 

0.992 

0.988 

10^ 

0.801 

0.830 

0.831 

0.986 

0.955 

10* 

0.540 

0.579 

0.577 

0.984 

0.931 

10® 

0.540 

0.579 

0.577 

0.984 

0.931 
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Table  3,  Clamped  square  plates  under  uniform  load; 

¥ 

MIN3^  center  deflection  results  normalized  with 
respect  to  analytic  solution  according  to  Mindlin 
theory  (E=10 .92x10®  ,  u=0.3) . 


L/h  _ mesh 


G 

H 

10 

0.966 

0.968 

10^ 

0.922 

0.942 

10^ 

0.915 

0.968 

10® 

0.915 

0.968 

Table  4.  Simply  supported  square  plates  under  uniform  load; 
MIN3  center  deflection  results  normalized  with 
respect  to  analytic  solution  according  to  Mindlin 
theory  (E=10 . 92x10^  ,  \)=0,3). 


L/h 

mesh 

D 

E 

P 

G 

H 

10 

0.997 

0.997 

0,999 

1.007 

1.003 

10^ 

0.992 

0.991 

0.992 

1.006 

1.001 

10^ 

0.991 

0.991 

0.991 

1.006 

1.000 

10® 

0.991 

0.991 

0.991 

1.006 

1.000 

Table  5. 

Clamped  square 

plates  under  uniform  load; 

MIN3  ( 

oenter  deflection  results  normalized 

with 

respect  to  analytic  solution  according  to 

Hindi 

theory  (E-10. 92x10  ,  w-0.3)* 

L/h 

mesh 

G 

H 

10 

1,011 

1.014 

10^ 

1.003 

1.003 

10^ 

1.004 

1.001 

10® 

1.004 

1.001 
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ON  THE  SOLVABILITY  AND  COMPUTATIONAL  ASPECTS  OF 
A  REFINED  SHEAR  DEFORMATION  PLATE  THEORY 

J.  N.  Reddy 

Department  of  Engineering  Science  and  Mechanics 
Virginia  Polytechnic  Institute  and  State  University 
Blacksburg,  Virginia  24061  USA 

ABSTRACT 

The  existence  and  uniqueness  of  solutions  to  the  third-order  shear 
deformation  theory  developed  recently  by  the  author  are  examined,  and 
the  computational  aspects  of  the  associated  finite-element  models  are 
discussed.  The  existence  and  uniqueness  of  solutions  of  the  linear 
theory  are  proved  using  the  Lax-Milgram  theorem.  A  new  mixed  finite- 
element  model  that  uses  displacements  and  bending  moments  as  degrees  of 
freedom  and  a  displacement  finite-element  model  are  discussed.  The 
mixed  element  is  a  C®  element  while  the  displacement  element  is  a  C^ 
element. 

1.  INTRODUCTION 

The  origin  of  displacement-based  theories  is  due  to  Basset  [1].  In 
a  displacement-based  theory,  it  is  assumed  that  the  displacement 
components  of  the  plate  can  be  expanded  in  series  of  powers  of  the 
thickness  coordinate  z.  For  example,  the  displacement  component  u  in 
the  x-direction  in  the  N-th  order  theory  is  written  in  the  form 

N  f. 

u(x,y,2)  =  u°(x,y)  +  r  2%^"^(x,y)  (1) 

n*l 

where  x  and  y  are  the  Cartesian  coordinates  in  the  middle  plane  of  the 
plate,  and  u^'^^  have  the  meaning 

u^"^(x,y)  =  — L_n,  n  =  0,1.2,...  (2) 

d  z 

Basset's  work  did  not  receive  as  much  attention  as  it  deserves.  In  1949 
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NACA  technical  note,  Hildebrand,  Reissner  and  Thomas  [2]  presented  (also 
see  Hencky  [3])  a  first-order  shear  deformation  theory  [a  special  case 
of  Eq.  (1)], 

u(x,y,z)  =  u°(x,y)  +  zip^(x,y) 
v(x,y,z)  =  v°(x,y)  +  zi|i^(x,y) 

w(x,y,z)  =  w°(x,y)  (3) 

The  differential  equations  of  the  theory  were  derived  using  the 
principle  of  the  minimum  total  potential  energy.  This  leads  to  five 
equilibrium  equations  in  the  five  generalized  displacement  variables, 

u°,v°,w°,\ii  and  \i)  .  Mindlin  [4]  presented  a  dynamic  analysis  of 

A  y 

Hencky 's  theory  [3],  and  used  the  displacement  field  in  Eq.  (3)  for  the 
vibration  of  isotropic  plates.  We  shall  refer  to  the  shear  deformation 
theory  based  on  the  displacement  field  (3)  as  the  first-order  shear 
deformation  theory,  although  it  is  known  as  the  Mindlin  or  Reissner- 
Mindlin  theory  in  the  literature.  For  additional  references  on  the 
history  of  the  shear  deformation  theory,  the  reader  is  advised  to 
consult  Reference  5. 

Recently,  Levinson  [61  and  Murthy  (71  presented  third-order 
theories  in  which  vanishing  of  the  transverse  shear  stresses  on  the 
bounding  planes  of  the  plate  and  transverse  inextensibility  of  the 
normals  were  assumed.  However,  both  authors  used  the  equilibrium 
equations  of  the  first-order  theory  in  their  analysis,  and  thus  the 
higher-order  terms  of  the  displacement  field  were  accounted  only  in  the 
calculation  of  the  strains  but  not  in  the  governing  differential 
equations  or  in  the  boundary  conditions.  Recently,  the  present  author 
(see  (8,91)  derived  variationally  consistent  equations  of  motion  by 
means  of  Hamilton's  principle.  The  theory  presented  in  [81  accounts  for 
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moderately  large  deflections  but  is  limited  to  orthotropic  plates,  while 
that  in  [91  is  for  the  small -deflection  theory  of  laminated  plates. 
Finite  element  models  of  these  theories  were  presented  by  Reddy  and  his 
colleagues  [10,11].  The  present  paper  contains  a  review  of  the  theory, 
some  mathematical  results  on  the  existence  and  uniqueness  of  solutions 
of  the  linear  theory,  and  a  discussion  of  the  displacement  and  mixed 
finite  element  models. 

2.  EQUATIONS  OF  THE  HIGHER-ORDER  THEORY 

Consider  a  rectangular  laminate  with  planeform  dimensions  a  and  b 
and  thickness  h.  The  coordinate  system  is  taken  such  that  the  x-y  plane 
coincides  with  the  mid-plane  of  the  plate,  and  the  z-axis  is 
perpendicular  to  that  plane.  The  plate  is  composed  of  perfectly  bonded 
orthotropic  layers  with  the  principal  material  axes  of  each  layer 
oriented  arbitrarily  with  respect  to  the  plate  axes. 

In  order  to  obtain  a  parabolic  distribution  of  the  transverse  shear 
stresses,  a  cubic  expansion  is  used  for  the  inplane  displacements  and 
the  transverse  inextensibility  (i.e.,  the  transverse  normal  strain  is 
zero)  is  assumed.  The  resulting  displacement  is  given  by  (see  Reddy 
[8,9]): 

u^(x,y,z)  =  u(x,y)  +  z[>i)^(x,y)  -  |  (^) 2('i»j((x,y)  +  (x,y))] 

U2(x,y,z)  =  v(x,y)  +  z[ii;y(x,y)  -  ^  (^) 2(,l,^(x,y)  +  (x,y))] 

U3(x,y,z)  *  w(x,y)  ,  (4) 

where  Uj^,  U2  and  U3  are  the  displacements  in  the  x-,  y-  and  z- 
directions,  respectively.  The  displacements  of  a  point  (x,y)  on  the 
midplane  of  the  plate  are  denoted  by  u,  v  and  w;  i|)^  and  are  the 
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rotations  of  normals  to  the  midplane  about  the  y-  and  x-axes, 
respectively.  The  strains  can  be  obtained  from  Eq.  (4). 


Assuming  that  each  layer  of  the  laminate  possesses  a  plane  of 
material  symmetry  parallel  to  the  x-y  plane,  the  constitutive  equations 
for  the  k-th  layer  can  be  written  as 


where  a.  and  e^.  (i  =  1,2, 4, 5, 6)  are  the  stress  and  strain  components 
referred  to  lamina  coordinates  and  Q-  •'$  are  the  plane-stress  reduced 
elastic  constants  in  the  material  axes  of  the  k-th  lamina, 

^2  -  - 

Qll  =  Ej/(1-vj2^2i)»  ^22  ■  ^11’  ^12  "  '^12^22 

^44  "  *^23  ’  ^55  "  *^13  *  ^66  *  ^12 

and  E.,  and  G.  ^  are  the  usual  engineering  constants. 

The  equations  of  equilibrium  can  be  obtained  using  the  principle  of 
virtual  displacements.  In  analytical  form,  the  principle  can  be  stated 
as  follows  (see  [12]): 

J  (fiU  +  6W)dV  =  0  (7) 

V 

where  U  is  the  total  strain  energy  due  to  deformation,  W  is  the 
potential  of  external  loads,  and  V  is  the  volume  of  the  laminate 
and  6  denotes  the  variational  symbol.  The  following  five  equations  are 
obtained  for  the  linear  case: 
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l,x 


N.  =  0 

6,y 


Nc  +  N„  =0 

6,x  2,y 


Qi  +  Qr.  -  —  (Ri  +Ro  ]+  —  (Pi  +2P^  +P,  1  *  q 

I.X  ^a,y  ^2  ^  6,xy  2,yyJ 


3h2 


"l.x  ^  "6.y-  '>1*  h-  ^  (^l.x*P6,y)  =  » 


«6,x  *  «2,y-  V  ;;:  V  i:  ''6,x^^2,y)  '  » 


(8) 


where 

h/2  , 

(N  M  P  )  =  J  a  (1,2,2-^)d2  (i=l,2,6) 

^  ^  ^  -h/2  ^ 

h/2  . 

(Qp,Rp)  =  J  a,(1,2^)d2  (9) 

^  ^  -h/2  ^ 

h/2  p 

(Q,.R,)  =  J  oM,z^)(iz 

^  ^  -h/2 

In  the  general  case  of  arbitrary  geometry,  boundary  conditions  and 
lamination  scheme,  the  exact  analytical  solutions  to  the  set  of 
differential  equations  in  Eq.  (8)  cannot  be  found.  However,  closed-form 
solutions  exist  for  certain  'simply  supported'  rectangular  plates  with 
two  sets  of  laminate  stiffnesses,  as  described  in  [8,9,12]. 


3.  EXISTENCE  AND  UNIQUENESS  OF  SOLUTIONS 

A  formal  proof  of  the  existence  and  uniqueness  of  solutions  of  the 
equations  of  the  higher-order  theory  requires  us  to  establish  the 
positive-definiteness  of  the  bilinear  form  in  an  appropriate  function 
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(or  vector)  space.  First,  some  mathematical  preliminars  are  in  order 
(see  (131). 


Let  fl  denote  the  open  bounded  region  of  the  plate  (i.e.,  the 
midplane  of  the  plate)  and  r  be  its  boundary.  We  use  the  standard 
notation: 

L2(fi)  =  the  complete  named  (i.e.,  Hilbert)  space  of  generalized 
functions  that  are  square  integrable  in  a;  i.e.,  u€ 
L2(J2)  means 

J  |u(x,y)l^dxdy  <  « 

Q 


H^(n)  =  the  complete  normed  space  of  functions  which  along  with 

their  generalized  first  derivatives  belong  to  L-(n);  i.e., 
u  €  H^(n)  implies  ^ 


o 

/  |u(x,y)|  dxdy  <  » 

Q 


/  ||7(x,y) l^dxdy  <  * 


/  l|7(x,y) l^dxdy  <  - 
a 


H^(g)  =  the  complete  normed  space  of  functions  which  along  with 
their  generalized  derivatives  up  to  and  including  order 
2  are  in  L2(£2) 


(10a) 

We  shall  use  hJ(g)  to  denote  the  space  of  functions  u  from  H^(n)  that 

2 

vanish  on  the  boundary  r  of  a,  and  H^(n)  to  denote  the  space  of 

7 

functions  u  from  H  (n)  which,  along  with  their  first  derivatives,  vanish 
on  r.  The  norm  in  H”’(n)[H°(a)  =  L2(n)J  is  given  by 


Hull 


2 

m 


m 

E 


i,j=0 


ax^ax"^ 


2 

dxdy 


i+j  <  m 


(10b) 


Equations  (7)  and  (8)  can  be  expressed  in  terms  of  the  five 

generalized  displacements  u,  v,  w,  i|)  and  ii<  .  In  terms  of  the 

X  y 

displacements,  Eq.  (7)  has  the  form  (for  symmetric  laminates  only): 

B(a,a)  =  £(a)  (11) 
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where  a  =  (w.  a  =  (w,  i|»y),  and  B(-,-)  and  )i(-)  are  the 

bilinear  and  linear  forms  on  the  vector  space,  H 

H  =  H^ta)  X  Hj(a)  X  Hj(a)  (12a) 

The  norm  in  H  is  defined  by 

IIAII^  =  IIWII2  +  +  ll'l'yll^  (12b) 

where  ll-H  denotes  the  h'’’(o)  norm, 
m 

In  writing  Eqs.  (11)  and  (12),  it  is  assumed  that  the  inplane 
displacements,  being  uncoupled  from  w,  i|)  ,  )i»  ,  can  be  solved 

X  y 

independently  and  that  the  plate  is  subjected  to  clamped  boundary 
conditions.  The  bilinear  form  B(*,-)  and  linear  form  £(•)  are  given  by 


B(a,A)  =  ;  (^10 


®*x,j,  ’''x  I,  ^  _4_  j:  ,^*'x  3^W'  4  f  a^w,  I 


”v. 


ail*. 


aii; 


3h‘ 

4 


.  n  J  4  c  /'*'x  ,  3^W  ^  ^  c  f^'^y  ^ 

ir^"12  i)r  ^^22  ay  "  3^2  '^12^x  3^2^"  3^^2  '^22^y  ^^2^* 


ani  3<K  av  4  3^  ait  .2 

^  ^ay  ax  '^‘^66^ay  ax  ’  ^^2  '^66^3y  ax  ^  axay^' 


^’‘’x  ^  ax^^^55  ~  I2  °55^^'‘'x  ax^‘^^'‘’y  aP^V  ”  ^2  ^44^^’*'y  ^  ay^ 


/I  S'l'  ,2— 

"  (^-^)[F 


3h 


2  'ax 


ax 


3l|)  3’)b  y|  8lil  ^2 

X  ,  r  Y  ^  u  /  ^  ^  _ YL\ 

2^"  11  ax  '^12  ay  '  3^  ^U^ax 


.  3(1)  2  .  oii)  2 — 

4  u  /  y  u.  3  Wxi  4  ,  y  .  a  Wwr 
~2  ■  ~2^Jy  — )IF 

3h^  3h^  ay 


2'^  12  ax 


3ip. 


F 

*^22  ay 


^  u  a^wx  4  J.  a^wx,  4  ,^''^x 


3h 


ax 


3h 


2  ”22'ay 


ay 


3h 


2'ay  ax 


2—  ai)  ii  aip  3)i»  2 

+  2  ^  )  ( F  f ^  +  — ^) _ _  H  ( — -  +  — ^  +  2  -2-!!!^-)  1 

^  axay^^^ee^ay  ax  ^  3^  "es^ay  ax  ^  axay^^ 
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(13a) 


*^(a)  =  /  qwdxdy 

where  D^-j,  and  are  the  laminate  stiffnesses, 

h 

“  2 

Equation  (11)  is  called  the  variational  problem  associated  with  Eq. 
(8)  when  applied  to  a  clamped  laminate.  We  wish  to  show  that  the 
variational  problem  (11)  has  a  unique  solution  in  the  space  H.  Toward 
establishing  this  result,  we  use  the  well-known  Lax-Milgram  theorem  (see 
Reddy  [13]): 

Theorem  1:  Let  H  be  a  Hilbert  space,  and  let  B(-,-)  be  a  bilinear 

form  on  H  with  the  following  properties: 

(i)  |B(a,a)|  <  MiIaii^IIaII^  (continuity)  (14a) 

(ii)  B(a,a)  >  allAll^  (positive-definiteness)  (14b) 

for  all  A,  a^H.  Then  for  any  continuous  linear  functional  »,(•)  on  H, 

there  exists  a  unique  vector  a  in  H  such  that 

0 

B(A,Aq)  =  £(a) 

holds  for  every  TCH. 

A  proof  of  the  theorem  can  be  found  in  (13).  Thus,  if  we  can  show 
that  B(*,-)  and  a(*)  of  Eq.  (13)  are  continuous  and  B(*,*)  is  positive- 
definite  on  H,  then  we  have  a  unique  solution  to  Eq.  (11).  It  can  be 
shown  that  the  solution  of  the  variational  problem  is  also  the  solution 
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of  the  classical  problem  (i.e.  the  original  differential  equations), 
provided  that  the  boundary  of  the  domain  and  the  data  q  are  sufficiently 
smooth. 

We  now  set  out  to  establish  the  continuity  and  positive¬ 
definiteness  properties  of  The  continuity  of  a(-)  is  obvious. 

From  Eq.  (13a),  we  have 


=  J  |c,(|^  +  *  ♦,)  *  C2(^  +  +  (.p 


3W  .  T  \/3W 


.2-  2 


2-  .2 

3  W  3  W 


2-  2  2  2- 
-3W3W,  3W3Wa 


3W3W._  3W3W  /3  W  3  W  3_W^ 

+  C^ — ? — ?  ^4 — 2 — 2  ^5^  2~  2  2' 


3X  3X 


3y  3y 


3x  3y  3x  3y 


. .  i!w_  ifiL . .  ( A  !!i .  ifs! 

®  3X3y  3X3y  3x^  ax^ 

^  3^W  /A  +  A  ^ 

2  3,b  2  3il)  3t|/„  .2 

+  3  _ X  ^  _ ^.1  +  2c  f  ( _ -  +  — ^  ■) 

2  3y  ,  2  3x  ^  ^^lO^^ay  3x  ^^3X3y^ 

9X  9y 


■'^77  ^’*'y  ^'‘'y  ^'•'v 

.  / _ X  _ /x  3  VjL  1  +  r  _ ^  ^  +  C  ( _ -  — ^  +  - -  - 

+  +  3X  ^  3X3y*  ^11  3X  3X  ^12'3X  3y  3X  3y  ' 


+  C,,  +  C,.(^  +  T^){—  +  ^)]dxdy 


'13  ay  ay 


•14'3y 


ay  3x 


where 


^^55  j^4  *^55  *  *“2  ^44  ^4  *^44 


^  16  H  -_16u  c=^H 

<=3  =  ;;;3"ii  •  "4 -^"22  •  "5  g^4**i2  •  <=6  g^A^ee 
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“  9h^  "ii  ■  ^ '^11  ’  *^8  =  ^ ^  F,,  .  <=9  =  ^  Hi2 -j;;?  Fi2 


"1“  “  9^  ■  i? '^66  ■  ‘11  =  ’' 


=  D  -  ^  P  1  16  u 

“  11  3^2  11 


'^12  '  “l2  -  3^2  '^12  ■"  1l2  •  ‘^13  ’  “22  ’  3^2  ^22  *  ^  "22 


c  -  D  -  ^  P  +  1®  LI 

'14  “66  3^2  “66  " “66 


The  continuity  of  B(',-)  can  be  proved  by  using  Holder's  inequality 
for  sums  of  integrals  [13], 


z  /  u.v.dxdy  <  z  (/  |u.|^dxdy]^/^[J|v.|^dxdy]^/^ 
1=1  ^  i=l  a  '  ^ 


n 

<  {  E  /  |u.  |^dxdy)^/^(  z  J  [vj^dxdy)^/^ 
i=l  a  ^  i=l  n  ^ 


We  have 


|B(a,a)|  <  M(iiwii2  +  +  llili^ll^)  •  (llwll^  + 


H"""H 


where  M  =  max  ( |c^| , [c^l .. . . , |Cj^| ) . 

Next  consider  the  bilinear  form  B{a,a): 


B(a,a)  = 


^  ^  <^2(|j  *  V)^  -  <=3(4^ 

3X 


2  2  2  2 
+  r  l^\'^  +  ?r  3  W  3  W  .  /3  W  x2 

ax2  «  '"8  ay8  ^  ^ 
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(18) 


gy2  ax  '  ^‘'10' ay  ax  '  axay 


o  3'ti„  3>l»^  a'l'^-o 


'12  ax  ay  13'ay 


3''Py  ? 


Next  we  use  Korn's  inequality  (see  [13]):  For  a  =  (u,v)C 

H^(j2)  x  H^(n)  there  exists  a  constant  c(q)  (depends  only  on  si)  such  that 
0  0 

(IIUII^  +  iv|2)l/2  <  c(a){J  12(|i)^  +  2(|j)^  +  J  (If  +  I;)^l8xily  (19) 
Then  we  have 

B(a,A)  >  allMI^ 

where  a  >  0  is  a  constant  that  depends  on  si  and  the  constants  Cj^, 
C2,-..»Cj^4,  but  is  independent  of 

Since  B(*,-)  is  continuous  and  positive-definite  on  H,  the 
variational  problem  (11)  has  a  unique  solution  a  =  (w,ib^,Vy)  in  H. 
Although  we  have  given  the  results  here  for  clamped  plate,  the  results 
can  be  extended  to  simply  supported  plates  and  plates  with  mixed 
boundary  conditions. 

4.  FINITE  ELEMENT  MODELS 

4.1  Displacement  Finite  Element  Model 

It  is  informative  to  note  that  the  equations  of  the  present  higher- 
order  theory  has  terms  from  both  the  classical  and  first-order  shear 
deformation  theory.  Although  the  number  of  generalized  displacements 
are  the  same  in  the  higher-order  and  first-order  theories,  the  higher- 
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order  theory  contains  terms  involving  the  second-order  derivatives  of 

the  transverse  deflection  in  the  potential  energy  expression.  Hence, 

like  in  the  classical  plate  theory,  Hermite  interpolation  of  the 

transverse  deflection  is  required,  while  Lagrange  interpolation  of 

il)  and  li)  is  admissible  for  the  displacement  finite  element  model. 

A  y 

The  finite-element  model  is  of  the  form 


|K=I{4®}  =  {F=} 


(20) 


where  [K®]  is  the  element  stiffness  matrix,  {a®}  is  the  vector  of  the 

nodal  values  of  the  generalized  displacements,  and  {F^}  is  the  force 

vector  that  contains  contributions  from  both  applied  loads  and 

contributions  from  the  boundary  of  the  element  (i.e.,  internal 

forces).  In  the  present  study,  the  isoparametric  rectangular  element  is 

used  for  ij)  and  4)  and  the  four-node  Hermite  cubic  element  is  ued  for  w. 
X  y 

The  element  stiffness  matrix  [K®]  is  of  the  form 

[K 

[K®]  =  [K"^]'  [K^^l  [K^^]  1  (21a) 


where 


10 


3<D.  3<t>.  . 

f  f  r  - 1.  _ i  +  r  _ 1- 

•^j^e  1  ax  ax  ^2  ay 


‘h 

[K^^l 

[K^^l 

[K^^l 

13jT 

[k23]T 

2, 

2 

34)  . 

■  — + 

a  4). 

C-  - 

a  4)  . 

- tA  + 

ay 

3  2 

ax 

3X^ 

^  3  $i  a 

'4  2  2 

ay  ay 


+  c.( 


2  2  2  2  2  2 

a  41.  a  4i.  3  4).  3  4).  a  4i.  a  4). 

^  - +  4c.  TTjri^Jdxdy 


5^  2  2 

ax  ay 


2  2 
ay  ax 


,12 


2 

a4>.  a  4i.  aiiii  - 

‘=1 =7^7-17- *'=9 ;7 

2Cio  ^ 


6  3X3y  3X3y^ 

s. 

ax 


3  3\l)j 
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2  2  2 

3()),  9  (j),  1  3  4>»  B'li-i  3  S'!*! 

>'2  IT  *j  ^  <=8  ^  TT "  ^9  ;;r  aT  *  m? 


3<Pj  Bipj 


9(l»j  9il)j 


=  r  ICi(I)t<I>i  +  Cii - +  Ci, - ^Idxdy 

j  g  I  j^vjvj  ^2^  gj^  2^4  ay  3y 


9i|jy9i|j>  9i[ij3i|)^ 

^^12  iF  *^14  aF 


3ipj  3<l>j 


3i|)j  9ipj 


^  J'„e  ^‘^2’‘’I'*'J  ^^13  3y  ay  ^^14  3X  3x 


(21b) 


4.2  Mixed  Model 

To  relax  the  continuity  requirements  placed  in  the  displacement 

model  described  above,  here  we  consider  a  mixed  formulation  of  the 

problem.  In  a  mixed  model,  independent  approximations  of  the 

displacements  and  stress  resultants  are  used.  A  mixed  variational 

formulation  of  these  equations  can  be  obtained  by  treating  u,  v, 

w,  M2,  M2,  Mg,  P2 ,  P2  and  Pg  as  the  primary  variables.  The 

governing  equations  for  u,  v,  w,  ip  and  41  are  given  in  Eq.  (8).  The 

x  y 

equations  for  the  other  six  variables  (M2,  M2,  Mg,  P2,  P2  and  Pg)  are 
provided  by  the  laminate  constitute  equations; 

{N}  =  [A](e°}  +  [Bl{ic°}  +  (E]{k;^} 

{Ml  -  [Bl'e'’}  +  I0|{<°}  +  IFKk^I  (22a) 

(Pi  =  IE|(e“}  ^  IF|{k“1  .  1H|{«2} 


where  A^j,  etc.,  are  the  plate  stiffnesses,  defined  by 


(22b) 
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h/2 


=  J  Q.  .(^,z,z^.z^2'^,z®)d2  (i,j=l,2,6) 
-h/2 


(23a) 


=  r 

j 


h/2 


Q.  .(l,2^z^)dz  (i,j=4,5) 


(23b) 


-h/2 

The  mixed  model  is  of  the  form 

[K®]{a®}  =  {f®}  (24) 

where  [K®],  {a®}  and  {F®}  are  the  generalized  element  stiffness  matrix, 
element  displacement  vector,  and  element  force  vector,  respectively. 

The  vector  {a®}  denotes  the  set  of  nodal  values, 

{a  }  =  {u ,U2»  •  •  •  jU^ , V , V2 1  •  •  •  j V^ , Wj^ , W2 ,  •  • . , , 

.2  ,n  ,1  ,2  ,n  ul  m2  mI^ 

XX  ^  ^  ^ 

M2,M2,...,M2,Mg,Mg,...,Mg,pJ,Pp...,pJ, 


ftl  p2  pO  pi  p2  p^l 

K2,i'2,...,K2,rg,rg,...,rg|^g^ 


(25) 


The  displacement  model  results  in  eight  degrees  of  freedom  (u,  v, 

2 

lx*  ly’  1^*  ’‘’x*  '•'y^  P®'"  ^°^®  ^^®  "'°^®^  results  in 

eleven  degrees  of  freedom  (u,  v,  w,  M^,  M2,  Mg,  P^,  P2,  Pg)  per 

node.  The  displacement  model  with  linear  approximation  of  u,  v,  and 
il»y  and  Hermite  cubic  interpolation  of  w  results  in  stiffness  matrix  of 
order  32  x  32.  For  linear  interpolation  of  all  variables,  the  mixed 
element  results  in  stiffness  matrix  of  order  44  x  44.  Thus,  the  mixed 
element  is  computationally  expensive  compared  to  the  displacement  model, 
although  increased  accuracy  of  the  bending  moments  is  expected  in  the 
mixed  model. 
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1.  INTRODUCTION.  A  system  of  conservation  laws  in  one  space 
dimension 

Ut  +  F(U)^=0  -»<x<<»,  t>0  (1.1) 

is  strictly  hyperbolic  at  U  if  dF(U)  has  distinct  real  eigen¬ 
values.  Many  hyperbolic  systems  of  physical  interest  fail  to  be 
strictly  hyperbolic  at  every  point.  In  this  paper,  we  summarize 
recent  work  on  2x2  hyperbolic  systems  that  are  strictly  hyper¬ 
bolic  except  at  a  single  point  called  an  umbilic  point. 

Equations  with  umbilic  points  arise  as  models  of  three  phase 
flow  in  a  porous  medium.  In  this  context,  the  Riemann  initial 
value  problem  assumes  especial  importance,  since  it  is  central  to 
numerical  front  tracking  methods  based  on  Glimm's  scheme.  The 
Riemann  problem  for  (1.1)  has  jump  initial  data 

if  X  <  0 

U(x,0)  =  ^  ^  .  (1.2) 

•■U^  if  X  >  0 
K 

Solutions  of  (1.1),  (1.2)  involve  combinations  of  centered  shock 
and  rarefaction  waves.  Each  wave  in  a  specific  characteristic 
family  may  be  characterized  by  the  state  U_  on  the  left  of  the 

wave  in  (x,t) -space  and  the  wave  strength.  For  fixed  U  ,  the 

strength  of  the  wave  parameterizes  a  one-parameter  family  of  states 
U^  on  the  right  of  the  wave.  These  one-parameter  families  (one 

for  each  characteristic  family)  are  called  wave  curves.  The 
umbilic  point  has  a  profound  effect  on  the  structure  of  wave 
curves.  An  overall  goal  of  this  research  is  to  classify  the  vari¬ 
ous  effects  produced  by  the  presence  of  such  a  point.  The  classi¬ 
fication  of  2x2  systems  in  [3]  identifies  four  different 
regimes  that  we  describe  in  12.  In  53  we  illustrate  the  effect  of 
the  umbilic  point  by  describing  the  solution  of  a  prototype  Riemann 
problem.  Of  particular  note  are  two  new  types  of  shock  wave  violat 
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ing  the  Lax  entropy  condition.  These  overcompressive  and  under- 
compressive  shocks  are  associated  with  the  presence  of  the  umbilic 
point,  which  confuses  the  labelling  of  the  characteristic  speeds. 
Details  of  the  results  and  properties  described  here,  together  with 
new  techniques  for  studying  Riemann  problems,  are  discussed  fully 
in  [3,4] . 

2.  CLASSIFICATION  OF  EQUATIONS  WITH  UMBILIC  POINTS.  To  moti¬ 
vate  the  analysis,  we  begin  by  describing  equations  modelling  three 
phase  flow  in  a  porous  medium.  Let  u,v,w  denote  volume  fractions 
of  the  phases,  with  corresponding  relative  permeabilities  f,g,h. 

We  assume  the  flow  is  one-dimensional,  is  not  subject  to  external 
forces,  and  that  the  pressure  in  each  of  the  phases  is  the  same. 

We  make  the  constitutive,  or  modelling,  assumption  that  f,g,h 
depend  only  on  u,v,w  respectively :  f  =  f(u),  etc.  As  usual,  the 
conservation  of  momentum  is  approximated  by  Darcy's  law,  which  for 
1-d  flows  enables  us  to  express  the  velocities  in  terms  of  the 
volume  fractions.  This  puts  the  conservation  of  mass  in  the 
following  form: 


u 


t 


+ 


^  D  'x 


,g(v) 

‘x 


0 

0, 


(2.1) 


with  D  =  f(u)  +  g(v)  +  h{w)  and  u  +  v  +  w  =  1.  The  physical 
range  for  u,v  and  w=  1  -u-v  is  [0,1].  Note  that  the  corre¬ 
sponding  equation  for  w  is  redundant  and  thus  excluded  from 
(2.1),  and  that  f,g,h  are  taken  to  include  the  corresponding  vis¬ 
cosities  . 


Here  are  properties  of  f,g,h  that  reflect  experimental  re¬ 
sults  for  2-phase  flow: 

(A)  f(0)  =  f‘(0)  =  0,  f"(u)  >0,  0  <  u  S  1, 

and  similarly  for  g,h. 

Let  us  return  to  terminology  for  general  2x2  systems  in 
order  to  state  a  result  for  system  (2.1): 

Ut  +  F(U)jj  =  0  ,  (2.2) 


where  U  =  U(x,t)  e|R^,  F:  IR  -»IR^.  System  (2.2)  is  hyperbolic  if 
dF(U)  has  real  eigenvalues  (characteristic  speeds),  which  we  label 
A^{U)  s  A2(U).  If  A^(U)  =  A2(U),  we  call  U  an  umbilic  point. 

An  umbilic  point  U*  is  essential  if  the  set  of  matrices 

* 

y  =  (dF(U) :  U  near  U  )  is  a  smooth  2-dimensional  manifold,  in 
«  * 
which  U  is  the  only  umbilic  point,  and  dP(U  )  is  diagonal. 

The  term  "essential"  reflects  the  property  that  such  umbilic  points 
cannot  be  removed  by  perturbations  of  F. 
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Theorem  2 » 1 .  Under  assumption  (A),  system  (2.1)  has  a  unique 
essential  umbilic  point  in  the  triangle  0  <  u,v,l  -  u  -  v  <  1. 

We  next  turn  to  the  structure  of  wave  curves  for  system  (2.2) 

* 

near  an  essential  umbilic  point  U  .  Consider  the  Taylor  series 
for  F(U)  about  U 

F(U)  =  P(U*)  +  dF(U*)(U  -  U*)  +  Q(U  -  U*)  +  h.o.t., 

12  ♦ 

where  Q  =  -j  d  F(U  )  and  h.o.t.  indicates  the  remainder  term. 

*  * 

Since  F(U  )  is  constant  and  dF(U  )  is- a  multiple  of  the 
Identity  (and  may  be  removed  by  changing  to  a  moving  system  of 

* 

coordinates),  the  first  term  to  affect  wave  curves  is  Q(U  -  U  ). 
Assuming  Q  is  nondegenerate  in  a  sense  specified  in  [3],  the 
higher  order  terms  do  not  change  the  qualitative  properties  of  wave 

curves  near  the  umbilic  point.  We  shall  ignore  the  higher  order 

* 

terms  in  what  follows.  Taking  u  =0  without  loss  of  generality, 
we  are  left  with  the  system 

Ut  +  Q(U)^  =  0  -«p<x<«*,  t>0  (2.3) 

Now  U  =  0  is  automatically  an  umbilic  point  for  (2.3),  and  is 
essential  if  dQ(U)  has  distinct  (real)  eigenvalues  for  every 
U  0. 

Now  linear  changes  of  dependent  variable  U  in  (2.3)  do  not 
affect  the  characteristic  speeds,  or  other  features  such  as  the 
admissible  shock  waves  and  the  rarefaction  waves.  Accordingly,  it 
is  appropriate  in  a  classification  of  equations  (2.3)  to  call  two 
equations  equivalent  if  a  linear  change  of  variable  converts  one  to 

2  2 

the  other.  That  is,  quadratic  maps  ^  (k=l,2)  are 

equivalent  if  there  exists  a  (constant)  matrix  S  such  that 
Qj(SU)  =  SQ2(U)  for  all  U  «  IR^ .  If  Q(U)  =  dC(U)  for  a  homogen- 

3 

eous  cubic  scalar  C:  R  -»  R,  then  (2.3)  is  symmetric,  and  automa¬ 
tically  hyperbolic.  The  following  result  says  that  up  to  equiva¬ 
lence,  all  equations  (2.3)  with  an  essential  umbilic  point  are 
symmetric . 

Theorem  2.2.  If  U  =  0  is  an  essential  umbilic  point  for  (2.3), 
then  Q  is  equivalent  to  dC,  where 

C(u,v)  =  au^/3  +  bu^v  +  uv^  (2.4) 


and  a  ^  1  +  b 
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Note  that  (2.4)  Is  not  the  most  general  cubic  scalar.  We  have 

Q 

rotated  coordinates  to  eliminate  the  v  term  and  scaled  to  let 

2 

the  coefficient  of  uv  be  unity.  Both  of  these  transformations 
preserve  equivalence.  Theorem  2.2  reduces  the  study  of  equations 
(2.3)  to  an  investigation  of  a  2-parameter  family  of  equations: 

"t  *  ‘^C(U)^  =  0  (2.5) 

Recall  that  shock  waves  for  a  2x2  system  are  compressive 
if  they  satisfy  the  Lax  entropy  condition  [1],  which  requires 
characteristics  of  one  family  to  enter  the  shock  from  both  sides 
and  characteristics  of  the  other  family  to  pass  through  the  shock. 
There  are  also  what  we  call  overcompresslve  shocks  for  which  both 
families  of  characteristics  enter  the  shock,  and  noncompressive 
shocks  for  which  both  families  of  characteristics  pass  through  the 

shock.  If  we  fix  U  ,  all  possible  shock  waves  with  U  on  the 

^  L 

left  are  identified  by  the  set  of  U  lying  on  a  shock  wave  curve. 


Figure  1.  Regions  in  the  ( a, b) -plane. 
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These  U_  satisfy  the  Rankine-Hugonlot  condition  and  correspond  to 
R 

compressive  and  overcompresslve  shocks,  (Certain  noncompress ive 
shocks  also  need  to  be  selected  in  general .  This  is  discussed 
briefly  in  %3.)  We  also  have  rarefaction  curves  which  give  the 
values  of  U  through  a  centered  rarefaction  wave.  These  values 
lie  on  an  Integral  curve  of  one  of  the  two  right  eigenvectors  of 
dF(U) .  In  Figure  2,  we  show  these  wave  curves  for  (2.5)  with 

U  =  0  on  the  left  of  the  wave.  There  are  four  regions  in  which 
L  , 

the  structure  of  the  wave  curves  changes;  these  are  shown  in  Fagure 

1 . 


S^:  slow  shocks 

R  :  slow  rarefactions 

1 


S^:  overcompresslve  shocks 

R  :  fast  rarefactions 
2 


Figure  2.  Wave  curves  originating  at  the  \imbllic  point. 
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To  understand  the  over compressive  shocks,  consider  the  Rlemann 

problem  (1.2),  (2.5)  in  region  2,  when  =  0  and  U.  is  close 

L  R 

to  the  curve  representing  overcompresslve  shocks  in  Figure  2. 

The  solution  involves  two  compressive  shock  waves  of  nearly  equal 
speed  separated  by  a  state  on  the  curve  .  This  is  illu¬ 

strated  in  Figure  3.  Thus,  an  overcompresslve  shock  wave  is  a 
superposition  of  fast  and  slow  compressive  shock  waves  travelling 
at  the  same  speed. 


Figure  3.  Rarefaction  curves. 
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In  Figure  4,  we  show  the  patterns  of  rarefaction  curves  for 
(2.5)  in  the  four  cases.  Note  that  the  arrows  indicate  the  direc¬ 
tion  of  increasing  characteristic  speed. 


Figure  4.  Solution  of  Riemann  problem. 

^R  ®0- 

The  following  result  shows  how  the  3-phase  flow  model  (2.1)  fits 

into  the  classification.  By  Theorem  2.1,  system  (2.1)  has  an 

*  ♦ 

umblllc  point  (u  ,v  ).  Let  Q  denote  the  quadratic  terms  of  the 

♦  * 

Taylor  expansion  of  the  nonlinearity  of  (2.1)  about  (u  ,v  ). 

Theorem  2.3.  Under  assumption  (A) ,  the  quadratic  mapping  Q  for 
system  (2.1)  is  equivalent  to  dC,  for  C  given  by  (2.4)  with 

a  <  1  +  b^ . 


*  « 

In  other  words,  system  (2.1)  has  properties  near  (u  ,v  )  corre 
spending  only  to  quadratic  nonlinearities  in  regions  1  and  2  of 
Figure  1 . 


3.  SOLUTION  OF  A  RIEMANN  PROBLEM.  In  this  section,  we  ex¬ 
plain  the  solution  of  the  Riemann  problem 

-  (z^)^  =  0  (3.1) 


z(x,0) 


X  <  0 
X  >  0  ' 


(3.2) 


where  z  =  u  +  iv.  Note  that  equation  (3.1)  has  the  symmetry  pro¬ 
perty  that  e^’^^^^z  and  z  are  solutions  whenever  z  is  a  solu- 
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tlon.  For  each  Zj^,  z^,  there  Is  a  combination  of  centered  shock 
and  rarefaction  waves  that  Join  z  to  z_.  For  fixed  z  ,  each 

li  R  L 

combination  of  successively  faster  waves  determines  a  value  of  z  . 

R  ^ 

and  as  the  strengths  of  the  waves  change,  so  does  the  value  of  z^^. 

In  Figure  5,  each  possible  combination  of  shock  waves  (S)  and 
rarefaction  waves  (R)  Is  indicated  and  associated  with  a  region. 


Figure  5.  Solution  of  the  Rlemann  problem  (3.1),  (3.2) 
for  a  typical  z  . 

Ii 

For  example,  if  z^^  lies  in  a  region  labelled  (RS)S,  then  the 

solution  of  the  Rlemann  problem  Involes  a  slow  rarefaction-shock, 
and  a  fast  shock  wave.  The  label  Z  Indicates  an  under compressive 
shock.  These  are  noncompress Ive  shocks  that  have  viscous  profiles 
associated  with  the  Burgers'  system 

-2 

z.  -  (z  )  =  e  z  ,  (3.3) 

t  X  XX  I  7 
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Now  as  z_  changes,  the  z_  regions  get  distorted.  The 
L  R 

regions  have  the  shapes  indicated  in  Figure  5  only  for  2_  between 
the  lines  =  {z  >  0}  and  =  {z  =  :  a  >  0},  which  are 

lines  of  symmetry  for  equation  (3.1).  As  crosses  Mg  or  M^, 

several  regions  coalesce  and  collapse,  and  then  reform  in  a  differ¬ 
ent  arrangement  that  is  easily  obtained  by  symmetry. 

One  interesting  feature  of  Figure  5  emphasizes  the  role  of  the 
umbilic  point  z  «  0.  The  curve  PQ  separates  two  different  con¬ 
structions  of  solutions  of  the  Rlemann  problem.  If  z  lies  above 

K 

PQ,  then  the  standard  construction  of  Liu,  for  strictly  hyperbolic 
systems,  applies.  This  says  that  there  is  a  point  z^  on  the 

curve  ABC  (representing  slow  waves)  such  that  z  lies  on  the 

K 

curve  through  z  representing  fast  waves.  Below  PQ,  the  con- 

in 

struction  is  similar  except  that  the  slower  waves  (including  the 

under compressive  shocks)  are  represented  by  the  curve  WXYZ,  which 

does  not  Include  z  .  The  "coordinate  system"  that  was  used  above 

L 

o 

PQ  is,  roughly  speaking,  rotated  through  90  below  PQ. 

It  should  be  emphasized  that  the  solution  of  (3.1),  (3.2)  has 
relied  upon  the  symmetry  property  of  (3.3)  to  distinguish  the  under 
compressive  shocks.  Without  the  symmetry,  or  if  the  diffusion 
matrix  fails  to  be  a  multiple  of  the  identity,  then  characterizing 
shocks  possessing  viscous  profiles  is  significantly  more  difficult. 
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ABSTRACT.  The  basic  equation  to  be  discussed  is: 

A[A(x)|f  t  0(x)z].-||  -0,  x>0,  t>0,  (*) 

A(x)  =  D(x)  »  apx^  +  6x, 

with  parameters  a>0,X<l,  p<l,8€IR.  The  standard  heat  equation 
arises  for  X  =  -1,  p  =  B  =  0.  Crucial* for  the  construction  of 

solutions  of  (*)  is  a  function  v*(x,t;y)  which  is  derived  from 
the  fundamental  solution  v(x,t;y,s).  v*{x,t;y)  is  the  delta 
function  initial  condition  solution  of  (*)  with  6  applied 
at  X  =  y  =  0,  describing  distribution  processes  from  a  com¬ 
pletely  concentrated  initial  state.  v*(x,t;y)  is  the  kernel 

for  Poisson-Lebesgue  and  Poisson-Stielt jes  transform  solutions 
of  (*).  It  also  forms  the  basis  for  the  construction  of  bi-: 
orthogonal  solution  sequences  of  (*).  Furthermore ,■ it  is  the 
essential  ingredient  for  the  definition  of  a  generalized 
Jacobi  Theta  function: 


0(x,t)  =  v*(x,t,0)  *  2jj  V(x,t,y^). 

For  fixed  t  > 0,  the  function  v*(x,t;0)  may  be  interpreted  as 
a  probability  density  function.  It  contains  as  special  cases 
a  number  of  well  known  functions. 

I.  ORIGIN  OF  THE  EQUATION 

The  equation 

^  [A(x)  If  +  0(x)z]  -  II  =  0  ,  z  =  2(x,t),  (1.1) 

was  used  as  a  (linear)  one-dimensional  autonomous  model  for 
various  diffusion  type  processes  in  problems  of  heat  conduction, 
diffusion  of  atoms  into  semiconductor  materials,  and  decay  of 
molecular  systems  from  excited  states. 
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For  physical  reasons,  the  requirement  arose  that  equation  (1.1) 
be  of  such  a  nature  that  it  admits  a  conservative  similarity 
solution  in  the  open  first  quadrant  of  the  (x,t)  plane.  A 
solution  of  this  kind  is,  by  definition,  of  the  general  form 

z^(x,t)  =  b”^(t)f(a,  K  =  xb"^(t),  (1.2) 

with  b(0)=0,  b(t)>  0  and  increasing  as  t  t  and  f(  ^  )  such  that 

/  z^(x, t )dx  =  1  . 

0 

Under  appropriate  circumstances,  the  function  (1.2)  will  be  the 
delta  function  initial  condition  solution  of  equation  (1,1)  with 
the  delta  function  applied  at  the  origin  (x,t)=(0,0).  It  will 
then  describe  a  distribution  process  in  space  and  time  from  a 
completely  concentrated  initial  state  at  (0,0). 


Substitution  of  (1.2)  into  (1.1)  leads  to  a  condition  on  the  co¬ 
efficient  functions  A(x)  and  D(x)  of  tl\e  form 


F(A, 


=  0 


(1.3) 


and  to  a  first  order  equation  for  b(t)  and  a  second  order  equation 
for  f(^).  An  additional  requirement  is  needed  to  determine  A,D,b, 
and  f  uniquely.  It  could  be  derived  from  a  physical  principle 
associated  with  the  process  to  be  modeled  by  equation  (1.1)  or 
from  an  assumption  about  the  physical  nature  of  the  underlying 
diffusion  process.  It  was  decided  that  the  diffusion  coefficient 
A(x)  should  satisfy  a  power  law, 

A(x)=  ax  a>0. 


Relation  (1.3)  then  leads  to  the  drift  coefficient 
D(x)  =  apx^  +  6x,  6€1R  » 

and  the  functions  b(t)  and  f(^)  appearing  in  (1.2)  take  the 
specific  forms 


(  ,3=0. 

b(t)  =  I 

I  [a(l-X)3-'>  (1  -  exp  -  (1-X)3t)]^^'^^'  ,  3  0  ,  (1.4) 


f(5)  =  xb-^d),  q  =  (X-p)(l-X)-'>  .  (1.5) 
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It  is  clear  that  the  parameter  value  X  =  1  must  be  avoided  and 
that,  therefore,  X  > 1  or  X  <  1.  It  turns  out  that  X  <  1  leads  to 
physically  interesting  situations.  Because  of  the  conserva¬ 
tiveness  requirement,  it  is  necessary  to  restrict  the 
parameter  p  to  values  p  < 1.  This  then  makes  the  compound  parameter 
q  appearing  in  (1.5)  greater  than  -1. 

Special  cases  of  the  now  specified  equation  (1.1)  are  the  heat 
equation  [1]  for  X =  -1,  p  =  $  =  0.  and  the  Feller  equation  [2,3] 

for  X  =  0 . 

II.  A  FUNDAMENTAL  SOLUTION 
The  equation 

X[A(x)|f*DU)z]-  11=0  .  (2.U) 

A(x)  =  ,  D(x)  =  apx^  +  3x,  a  >  0,  *X  <  1,  p  <  (2.1b) 

% 

has  a  fundamental  solution  v(x,t;y,s),  x>0  ,t>0,  y>0,  s>0 
[4).  This  is  to  say  that  v(x,t;y,s)  for  fixed  y  and  s  is  a 
solution  of  (2.1)  as  a  function  of  x>0,  t>0,  and  for  x  and  t 
fixed  it  is  a  solution  of  the  corresponding  adjoint  equation 
for  y  >  0 ,  s  >  0 . 

A  special  solution  of  equation  (2.1)  can  be  obtained  from  the 
fundamental  solution 

v*(x,t;y)  =  (l-X)y(l'^^)  v(x,t;y,0) 

=  (e-3tTi)2(p-A) 

^  ,  (2.2) 

xb“^,  n  =  y  b”^,  q  =  (X-  p)  (1-X)  b  =  b(t)  as  given  by 

(1.4).  I  =  modified  Bessel  function  of  the  first  kind  (of 
q 

imaginary  argument) 

III.  OTHER  SPECIAL  SOLUTIONS 

The  function  v*(x,t;y)  defined  by  (2.2)  plays  a  crucial  role  in 
the  construction  of  other  special  solutions  of  the  equation  (2.1). 
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Boundary  condition  solutions  of  (2.1)  for  prescribed  behavior 
along  the  t-axis  have  been  established  in  [5].  They  will  not  be 
discussed  further  in  this  overview. 

Initial  condition  solutions  can  be  defined  as  Poisson-Lebesgue 
transforms 


z(x, t) 


v*(x, t;y)g(y)dy. 


(3.1) 


provided  that  g(x)  is  Lebesgue  summable  over  every  compact 
interval  of  0  <  x  <  «>.  This  class  of  solutions  has  the  property 
that 


z(x,0+)  =  g(x)  almost  everywhere, 

which  justifies  their  designation  as  initial  condition  solutions. 

Another  important  class  of  solutions  can  be  defined  as  Poisson- 
Stieltjes  transforms 


z(x,t)  = J  v*(x, t;y)dh(y) , 

0 

provided  that  h(x)  is  of  bounded  variation  on  every  compact 
interval  of  0  <x<  «■.  A  special  case  arises  if  one  takes  for  h(x) 
the  Heaviside  unit  step  function  at  x  =  y  >0.  This  situation 
identifies  v*(x,t;y)  as  the  delta  function  initial  condition 
solution  of  equation  (2.1)  with  the  delta  function  applied  at 
x=y>0,  t=0.  Letting  y  t  0  one  obtains  the  particular  solution 

v*(x,t;0+)  =  z^(x,t)  =  exp  .  (3.2) 


Thus,  the  similarity  solution  (1.2)  with  b  and  f  specified  by 
(1.4)  and  (1.5),  respectively,  is  being  recovered  and  now  identi¬ 
fied  as  the  delta  function  initial  condition  solution  with  the 
delta  function  applied  at  the  origin. 

For  details  on  this  section,  the  reader  is  referred  to  [4]. 
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IV,  BIORTHOGONAL  SEQUENCES  OF  SOLUTIONS 


Using  the  Poisson-Lebesgue  transform  (3.1)  with  g(x)  =  ^ 

=  exp  [  (n(  l-X)-p)  log  x  ]  with  log  x  real  for  x>0  ,  one  can 
define  the  sequence  of  initial  condition  solutions 


CO 

v^(x,t)  =  J*  v*(x,  t  ;y)y’^^  ^  ^^~^dy  (n=0 , 1 , 2 ,  .  .  .  ) . 

0 

Furthermore,  expanding  v* (x, t ; u^ ^  S  into  a  power  series 
about  u  =  0,  one  obtains 

v*(x,t;u^^"^^  ^  )  =  I  w^(x,t)u" 


which  converges  everywhere  in  the  (finite)  u-plane  and,  thus,  is 
an  entire  function  of  the  complex  variable  u.  It  can  be  verified 
directly  that  each  of  the  coefficient  functions  w^(x,t)  is  a 

solution  of  the  equation  (2.1)  for  x>  0,  t>  0,  although  none  of 
them  is  an  initial  condition  solution  in  the  sense  of  (3.1). 

The  sequences  {v^(x,t)}  and  {w^(x,t)}  have  the  property  that 


00 

j  x,-t)dx  = 

0 


0  if  m  n, 

n!  exp  -[1+n(1-X)]  6t  if  m  =  n. 


This  fact  allows  the  expansion  of  certain  types  of  solutions 
z(x,t)  of  equation  (2.1)  into  series  in  terms  of  the  functions 

v^(x,t)  for  example. 

A  typical  theorem  says  roughly  that  if  a  solution  z(x,t)  of  (2.1) 

GO 

is  of  a  certain  structure,  then  z(x,t)  =  ^  c  v  (x,t)  and  vice 

n  n 

n=o 

versa.  Details  on  this  subject  may  be  found  in  [6]. 
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V.  A  GENERALIZED  JACOBI  THETA  FUNCTION 


This  topic  has  been  developed  in  [7]  and  the  reader  is  referred 
to  this  paper  for  details.  In  terms  of  the  special  solution  (2.2) 
of  the  equation  (2.1)  the  function 

00 

e(x,t)  =  V*(x,t;0)  +  2  J  V*(x,t;y^)  ,x€(l,  te  I  ,  (5.1) 

n=l 

is  being  defined  for  a  suitable  sequence  {y^^}  /  0  <  yj^<  yn " 

as  n  t  «»  ,  as  a  function  of  the  complex  variables  x  and  t. 

A  special  case  of  (5.1)  arises  for  X“-l,  pssg=0  (heat 
equation),  and  y^^  nir, 

e(x,t)  =  ^  (x,e~'^  ®^),xeo:,tei;,  Ret>o,  (5.2) 

a 

3  being  one  of  the  Jacobi  Theta  functions  [s].  Chap.  XXI. 

The  analytic  properties  of  the  function  e(x,t)  which,  because  of 
the  relation  (5.2),  is  being  designated  a  generalized  Jhcobi  Theta 
function,  have  been  discussed  in  [7].  Typical  results  are  as 
follows! 

Theorem.  For  fixed  tC  £  such  that  Re  (-t)  <  0  and  for 


suitable  }  »  6(x,  t)  is  a  holomorphic  function  of 

X  €.  {x  €  C  I  X  0,|arg  x|  <  it  }. 


Termwise,  dif f erentation  is  justified. 

Remarks . 

a.  The  set  of  points  t  such  that  Re  (-t)  <  0  is  not 

empty.  It  contains  all  points  t  with  Re  t  >0,  Im  t  =  0. 

b.  In  special  cases,  the  holomorphic  nature  of  0(x,t) 
extends  to  all  of  (t  ,  for  example  in  the  case  specified  by  (5.2). 

Theorem.  For  fixed  x6{x€l:XfiO,  1  arg  x  |  <  n }  and  for 

suitable  6(x,t)  is  a  holomorphic  function  of  t  C  {t€  I  : 

Ret  >  O}  if  3=  0,  oft€Hif  3<0,  and  of  t  €  K  if  3  >  0. 

Termwise  dif ferentation  is  justified. 
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Remarks 


a.  Because  of  periodicity  for  B  0,  it  is  sufficient  to  consider  only 
the  behavior  of  e(x,t)  in  the  periodicity  strip  Re  o  6  Im  t  =  w  g 


The  sets  H  and  K  of  the  theorem  are  the 


unshaded  domains  shown  in  Figures  1  and  2,  respectively. 


t 
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b.  The  different  behavior  of  0(x,t)  as  a  function  of  t 

relative  to  the  values  of  the  drift  parameter  3  is  not  surprising. 
First  of  all,  the  different  structure  of  the  function  b(t)  for 

3  =  0  and  for  3  0  as  shown  in  (1.4)  should  be  observed.  Secondly, 
it  is  important  to  note  that  3  >  0  as  compared  with  3  <  0  results 
in  a  drastic  change  in  the  behavior  of  the  distribution  process 

v*(x,t;y),  y>0.  If3>0,  b(t)f[Qi(l-X)3*^]^^^^  >0 


as  1 1  ,  which  is  to  say  that  the  distribution  process  approaches 

a  steady  state  as  tf  This  is  in  contrast  to  the  case  3  1  0  in 
which  b(t)t  00  as  tt 

Theorem.  9(x,t)  is  a  particular  solution  of  equation  (2.1). 

VI .  PROBABILISTIC  CONSIDERATIONS 
The  real  valued  function  of  the  real  variable  x 


F(x) 


Y(1+q.^^  x>0,C=  xb“\  b>0,X<  1,  p<l, 

q  =  (X-p)(l-X)-‘'  >  -1, 


0  ,  X  <  0  , 


where  Y(a,y)  is  the  incomplete  Gamma  function,  may  be  designated 
a  probability  distribution  function.  The  nondecreasing  function 

F(x)  has  the  properties  F(x)>0,  F(x)'l'0  as  x  ,  F(x)'M 

as  X  t".  Consequently,  the  real  valued  function  of  the  real 
variable  x 


(|)  (x) 


clF(x)  ^  1  -  X 

dx  r(l+q) 

0  ,  X  <  0  , 


b-’  rp 


exp  - 


X  >  0 


(6.1) 


represents  a  probability  density  function.  One  recognizes 
immediately  that  e(x)  coincides  with  the  delta  function  initial 
condition  solution  (3.2)  of  the  equation  (2.1)  if  t  is  being 
considered  as  a  (positive)  parameter. 
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The  probability  density  function  <t)(x)  given  by  (6.1)  contains  the  following 
well-known  functions  as  special  cases: 

a.  the  Gauss  (normal)  pdf 


2_  1,-1  ^-(x/b)2 


for  X  =  -1 ,  p=0. 

b.  the  Weibull  pdf 
(l-X)b”^ (x/b)'^ 


-(x/b)^~^ 


for  p  =  X . 

c.  the  negative  exponential  pdf 

-1  -x/b 
b  e 


for  X  =  p  =  0  (special  case  of  Weibull). 

d.  the  Gamma  pdf 


TTT^ 


1  b"Vx/b)'P  e 


-x/b 


for  X  =  0. 


e.  the  Rayleigh  pdf 
2b-'(x/b) 

for  X  =  p  =-l  (special  case  of  Weibull). 

f.  the  Maxwell  pdf  (Maxwell  distribution  for  the  absolute  value  of  velocity) 


4 


b'^ (x/b)^ 


g-(x/b)2 


for  X  =  -1 ,  p  =  -2. 
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g.  the  Wien  pdf  (Wien  distribution  for  frequency,  limiting  case 
of  Planck’s  distribution  for  high  frequencies) 

1 


for  X  =  0,  p  =  -  3. 


It  is  useful  to  investigate  the  characteristic  function  associated 
with  a  probability  density  function.  For  the  function  (6.1)  it 
is  defined  as 

^  ^  J  ^  ^  ®  ^  e^^dx  ,  s  =  a  +  io), 

0 

or,  upon  the  substitution  x  =  bC, 

00 

"  rfi+q^)  .  (6.2) 

The  powers  of  5  are,  of  course,  defined  in  terms  of  log^  with  the 
principal  value  for  log  ^  for  ^  >  0. 


The  transformation  (6.2)  defines  $(s)  as  a  holomorphic  function 
for  Re  s<0  if  0<X<1,  for  Re  s  <  b"^  if  X  *  0, 

and  forsfitif  X<0,  i.e,^  for  X  <  0^  ^(s)  is  an  entire  function 
of  order  p  =  X~^(X-  1),  1  <  p  <  <»  . 


One  has  the  power  series  expansion 

*(s)  .  I:  sn 


n=o 


with 


“n  =  r(1+q+n(1-X)~1 

r{l+q) 


(n=0,l,2,...) 


(6.3) 


being  the  moments  of  the  probability  density  function  (6.1).  They 
exist  for  all  values  of  X  <  l.  However,  for  O^X  <i,  the  series 
evidently  does  not  have  a  positive  radius  of  convergence. 

For  X  =  0,  the  series  (6.3)  has  the  convergence  radius  b~^  >  0  and 
it  represents  the  function 
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^  (s)  =  (1-bs) 


(P  <  1) 


P-1 


for  |s|  <  b~^ .  Clearly,  for  X  <  0,  the  convergence  radius  is  +  “  . 

In  applications,  it  is  useful  to  identify  the  parameter  b  appearing  in 
(6.1)  with  the  function  b(t)  as  defined  by  0*^)  to  have  the  full  set  of 
parameters  a>0,B€nR,  X<1,  p<l,  t<0  available. 
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ON  THE  NUMERICAL  SOLUTION  OF  A  STOCHASTIC 
OPTIMAL  CORRECTION  PROBLEM* 

P.  L.  Chow  and  J.L.  Menaldi 
Department  of  Mathematics,  Wayne  State  University 
Detroit,  Michigan  48202 

ABSTRACT  The  numerical  solution  to  an  optimal  correction  problem 
for  a  damped  random  linear  oscillator  is  studied.  A  numerical 
algorithm  for  the  discretized  system  of  the  governing  variational  Inequalities 
will  be  given.  To  initiate  the  computation,  we  adopt  a  numerical  scheme  for 
the  deterministic  version  of  the  problem.  This  will  be  followed  by  an  algor¬ 
ithm  based  on  a  discrete  maximum  principle  to  ensure  the  convergence  of  the 
interation  process. 

I.  INTRODUCTION .  In  a  previous  paper  [IJ,  we  consider  the  control  problem  for 
the  damped  linear  oscillator  excited  by  a  random  noise 

r  X  +  px  +  X  =  rw  +v  ,  o<t<T, 

\  . 

I  x(o)  =  x^  ,  x(o)  =  y^ 

where  p,q  are  the  damping  and  spring  constants,  and  denote  the  initial 

position  and  velocity,  respectively;  r  is  the  intensity  of  the  white-noise 
the  control  momentum  at  t  ,  and  T  the  horizon.  Settings  y  =  x  , 
Equation  (1)  may  be  rewritten  in  the  Ito-diff erential  form  by  a  change  of  time 
scale : 


*This  work  has  been  supported  in  part  by  the  ARC  Contract  DAAG29-83-K-0014. 
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(2) 


dx  =  y  ds  , 
s  ■'s’ 


“  “(py  q  X  )ds  +  rdw  +  dv  , 


dt  =  ds  , 
s  * 


=  X  .  =  y  ,  =  t 


For  the  admissible  set  of  control  ,  we  take  the  set  of  all  processes 
of  bounded  variation,  which  depend  on  the  Wiener  process  '  in  a  nice  way- 
Also,  for  simplicity,  we  assume  the  average  cost  function  being  of  the  special 
form 

J^(x,y,t)  =  E  {  f(x^,y,j,)  +  c|v^|}  (3) 

where  f  is  smooth  and  of,  at  most,  pol3momial  growth;  c  >  o  is  a  constant, 
and  [v  I  is  the  sum  of  the  positive  variation  and  the  negative  variation 

t  t 

of  ,  If  we  denote  by  u  the  minimum  cost  function 

u(x,y,t)  =  inf  J  (x,y,t)  ,  (4) 

V  V 

then  we  can  show  that  it  satisfies  the  variational  inequalities: 


1  (a) 

^  + 
at 

Lu  0  , 

/  (b) 

1  du 
9y 

1  <  C  , 

(5) 

^  (c) 

Z'  du 

vat 

+  1  - 

)  = 

0  ,  u(x,y,T)  =  f (x,y)  . 

0  <  t 

<  T  , 

-  «>  <  x,y  <  “  , 

and 

Lu  « 

2  “  (2py+q  x) 
ay 

ay 

au 

(6) 

To  solve  the  problem  numerically,  we  must  replace  the  unbounded  domain  in 
the  x-y-t  space  by  a  rectangular  box  Q^={(x,y,t)  in  m®  :  [x|  <  2^  , 
|y|^£2>  .  Then  we  introduce  a  finite-difference  approximation 
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to  the  differential  inequalities  in  (5)  and  some  appropriate  boundary  conditions 


on  the  boundary  surface  of  .  To  this  end  we  let  Ax  - 


and  At  =  T/N  for  some  fixed  integers  M,N  >  o  .  Let  Q. 
of  mesh  points  in  ,  that  is 

Vn  ■  ‘K’-V'n’  •  =  1^*  .  yj  =  Jiy  . 


^M,N 


t^  =  nA  t  ;  i,  j  =  o,±l,  •  .  .  ,±M  ,  n=o,l, .  . .  ,N  }  . 


/M  ,  Ay  = 
denote  the  set 


The  pivotal  value  of  u  at  a  mesh  point 


=  u(?)  =  u(x  ,y  ,t  ) 

1  J  n 


is  given  by 


In  what  follows  we  shall  present  two  numerical  procedures  for  solving  the 
problem  (5)  corresponding  to  two  different  finite-difference  schemes  for  the 
variational  inequalities.  The  first  procedure  is  based  on  a  deterministic 
version  of  the  problem,  while  the  second  procedure  deals  with  the  full  problem 
directly.  For  convenience,  they  will  be  called  the  first-order  and  second- 
order  methods,  respectively.  The  former  method  is  simple  and  explicit,  but 
less  accurate.  The  latter  is  an  implicit  scheme  whose  convergence  can  be 
proven  by  means  of  a  maximum  principle.  It  seems  that,  by  using  the  first- 
order  procedure  to  obtain  an  initial  approximation,  the  rate  of  convergence  by 
the  second-order  procedure  can  be  increased  considerably. 

We  remark  that  the  analysis  of  the  control  of  the  system  (1)  follows  clbsely 
the  techniques  introduced  in  our  earlier  work  [27],  {3].  Some  of  the  numeri¬ 
cal  techniques  has  been  adopted  from  the  work  of  Gronzalez  and  Rofman  [4]. 


1 1 ,  A  F I  PA  1  - 1'  1.  D  F.  F  J 1 T  JMb  P 1 L  A1  -  T  hG_p_ .  T  o  a  v  o  i  d  u  nn  ecessary  algebraic  c  omp  1  i  c  a  - 
tlons,  we  set  p  =  o  in  Eq,  (1> .  Then  we  introduce  a  finite-difference 
a ppr ox ima 1 1  or.  t o  t h c-  in ec ua  ]  i  t y  ( 5  ,  a)  as  f  o  1  lows : 
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1  /  n+1  \  o  j.  n.+i  . 

At  ^“l.j  '  “l,j>  *  2(A5r)“^"l,J+l  ■  *  “l,j-l’ 

+  111  g  [hj(u^^,_.-u”_j)  +''j(V.,j-'‘i,J*l}  ■ 

1  ,  if  i  >  o 


where  h ,  = 

1 


o  ,  if  i  <  o  ,  and  =  (1-h.) 

11. 


(7) 


By  using  the  forward  and  the  backward  differences,  the  inequality  (5.b)  yields; 


.  -  u”  .  ,  <  cAy  , 

1,1  1,1-1 

u,  .-u.  cAy. 

1,1  1,1+1 


(8) 


The  expressions  (7)  and  (8)  can  be  rewritten  as: 


n 


uY  .  <  a..(h.  u?  .  +  h'.  u? 

1,1  11  1  1,1-1  1  1,1+1 


+  b .  .  (h .  u^j^,  .  +  h .  . ) 

ij  1  i+i,j  1  1-1  ,j 


,  ,  n+i  n+i  n+:  . 

+  c. . (u .  . .  -  2u.  .  +  u.  .  )  , 

iJ  1,1+1  1,1  1,1-1 


(9) 


n 

u.  . 

< 

cAy  , 

(10) 

n 

H  A. 

u.  .  ,  + 
1,1-1 

cAy  , 

(11) 

where 


^ij 


q  i  Ax 


d .  .Ay 
11 


i  I  Ay 


r2 


,  c , ,  = 


d_Ax  2d^j(Ay) 


Ay 

Let  ,  A2  ,  A^  denote  the  right-hand  sides  of  (9),  (10)  and  (11), respectively . 
Then  the  system  of  variational  inequalities  become 
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max 

Kk«3 


u,  .  ^  A,  u*  , 

1,3  ^  \  1,3 

r  1,J  \  1,3  'I 


N  -  f 
u.  .  =  f.  . 
i.J  1.J 


“i.J  ■  8i.j  •  • 


k=1.2,3  . 


0,  n“o,l,,.*,N;  i ,3=0 ,±1 , - - - ,±M  , 


(12) 


Here  the  last  condition  is  an  additional  boundary  condition  needed  when  we  re¬ 
place  the  unbounded  domain  by  a  rectangular  box.  The  first  numerical  algorithm 
is  given  as  follows: 

1. )  Set  n  =  N  and  ,  =  f,  .  ,  for  i ,  j=^0,±l ,  •  .  -  ,±N  , 

1,1  1,1 

2. )  For  (i  -  "M  ;  j-o,-! , , , . ,-M) , 

(i  =  M  ;  j-o,  1  M), 

(i  “  o  ,  -1,...,-M;  3  =  M) , 

set  u^^  .  =  g,  .  ,  with  11=0,.,  —  ,N-1  . 

1,1  1,1 

3-)  For  n-N-1  ,  calculate  u?  .  by  the  following  steps: 

1,1 

(a)  Set  \  for  (i-o ,  1 ,  .  .  .  ,M;  j-o)  .  From  (12),  compute 

1,1  1,1 

^  =  min  {  A  u]  .  )  for  j=o,l,  .  *  .  ,M  and  i=M,M-l,  *  .  .  ,0  . 


J.  ,  ,  I  i  •.  1^  . 


(b)  Knowing  u.  .  for  (i-o;  j=o,...,M)  and  (i=o ,-l , . . . ,-M;  3=M)  , 

1,1 

compute  uV  .  as  before  for  i=-l,.,,,-M  and  j -o ,-l , . . . , -M  . 

1,1 

(c)  Knowing  lu  .  for  (i=o,-l, — ,-M;  j=o)  and  (i=-M;  j=o ,-l , . . . ,-M)  , 

1,1 

find  uV  .  for  j-o,-l , .  .  .  ,-M  and  i--M,  -M-fl ,  .  . .  ,0  . 

1,1 

(d)  Knowing  1/.^  .  for  (  j=o  ,-l , .  .  .  ,-M)  and  (i-o ,  1 , ,  .  .  ,M;  j=-'M)  , 

*  1 , 

find  -  for  l  =  (),  *  , .  .  *  ,M  and  j--M,-MH-l,  ...  ,0  . 

>  J 

A.)  Check  if  the  compiiied  values  of  .  for  (i=o ,  I ,  .  . .  ,M;  3=0)  agree  with 

^  ,  .1 

ilint  of  the  initial  guess.  If  yes,  go  to  n.).  Otherwise,  go  to  5,), 
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5.)  Adjust  the  initial  values,  say,  by  taking  the  mean  of  the  guessed  and  the 
computed  values  and  repeat  the  steps  (3. a)  ^  (3*d)  until  they  agree  with  a 
prescribed  error.  Then  go  to  6.), 

6-;  Replace  n  by  (n-1)  in  3-)'  Repeat  the  steps  3.)  to  6.)  and  stop  after 
n=o  . 

The  above  procedure  can  be  schematically  shown  in  the  foliowing  Figure  1. 


/I 

/ 

/ 

V 

/' 

P 

- ^ 

(3,2) 

/ 
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/I 
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Figure  1 

A  First-^Order  Numerical  Procedure 


The  above  procedure  is  explicit  in  the  sense  that  one  can  march  backward  in 
time.  However,  we  do  not  expect  a  fast  convergence  of  the  scheme,  if  it  con¬ 
verges  at  all.  It  is  possible  to  devise  a  modified  scheme  to  improve  the 
accuracy  of  the  method.  But,  since  this  procedure  will  be  used  only  for  the 
initial  computations,  we  will  not  do  so.  For  the  major  part  of  computations, 
the  next  algorithm,  will  be  adopted  to  ensure  a  proper  convergence. 
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III.  AN  ITERATIVE  METHOD  BASED  ON  MAXIMUM  PRINCIPLE.  In  the  left-hand  side  of 
(7),  if  we  replace  the  superscript  n+1  by  n  for  each  of  three  terms  inside 
the  second  parenthesis,  we  would  obtain  the  following  finite-difference  approxi¬ 
mation  to  (5.1): 

^  n-hl  .  ,  n  .  n  ^ 

u.  .  <  a  u.  .  +  a.,(u.  +  u.  .  ,) 

i,J  1  1,1  2^  1,3+1  1,3-1 

+  a  (h.u”  .  ,+  h'.  u?  )  (13) 

3'  1  1,3-1  1  1.1+1 

+  a  (h.  u?  ,  .+  h'.  u”  ,  ,)  , 

4  3  1-1.1  1  1-1  .1 

where 


1  r^  q  i  A 

n  =  — i a  =  - - -  a  =  -1 — * — 1“ 

1  a  At  ’  2  2a  (Ay)  ’  3 


a  =  i  +  1^'  .  +  q2  +  iii^  . 

At  2(Ay)^  ^  Ay  Ax 


a  = 

^4  a  Ay  * 

o 


Here  we  note  that  Ct,  's  are  positive  with 


and  they  depend  on  (i,j)  • 

For  convenience,  we  regard  the  triple  array  .  as  a  vector  u  (or  a 

^ » J 

tensor) *  Let 

|.  n  ,  (2M+-1  )x(2M+1  )x(N+l  ) 

be  a  vector  in  (2M+1)^  x  (N+1) ^dimensional  space  E  with  the  maximum  norm 


max  u .  . 

i.i.n 


Let  us  define  a  linear  operators  and  on  E  such  that 
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(M-u)?  . 
1  i.J 


(M-u)?  . 
2  1,3 


=  the  right-hand  side  of  (13) , 


A  ^ 

A-u.  .  , 
2  1,3 


A  " 

A-u,  A 
3  i,J  • 


where  and  are  given  as  in  (12). 

Q  on  E  by  components: 


In  addition,  we  define  the  operator 


(Qu)  -  min  {  (M,  u) 
l.<k«3 


By  the  above  definitions,  in  contrast  with  the  system  (12),  we  get  the  following 
discrete  approximation  for  the  problem  (5) : 


(a)  u<M|^u,  k=l,2,3 


(b)  Qu  =  u  , 


(c)  uL„  =  f. 


(d)  u  =  g  for  1=±M  or  3=±M  , 


where, by  convention,  the  first  two  relations  hold  component-wise,  f  =  {f.  .} 

1  >  J 

and  g  =  {g  }  . 

^  >  J 

As  to  be  explained  later,  the  difference-inequality  (13)  yields  a  maximum 

principle.  Consequently  there  exists  a  unique  solution  to  the  problem  (19) 

which  can  be  constructed  by  successive  approximations.  This  forms  the  basis 

for  our  second  numerical  algorithm,  an  Iterative  scheme.  Given  in  E  , 

(k)  n  k 

we  denote  by  u^  =  {u.*.  }  its  k-’th  iterate.  The  iteration  procedure  runs 

1,1 

as  follows: 

1.)  To  start  the  process,  assume  the  values  of  are  given  so  that  they 


satisfy  (19. a, c  and  d) • 
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2,)  As  the  first  iterate,  set 


u 


(O 


{ u?  }  with 


{f_}  for  n^N;  i , 3=0 ,il ,  • . .  ,±M  , 


^,3 


{g.  for  n=o,l, * . . ,N-1 ;  or  j-iM  , 


(20) 


.  Otherwise  . 


3.)  Suppose  we  have  computed  u 
we  get 


(k) 


Then,  similar  to  the  above  computation. 


for  n-N;  i 

for  n-0,1. 

otherwise  , 

(21) 


4.)  Preassigned  an  acceptable  error  e  ,  The  iteration  process  terminates  at 
K-th  step  when  u^^  ^  ^  e  . 

By  the  properties  of  the  solution  of  (19)  ,  we  can  show  that  the  iteration- 
sequence  defined  above  converges  monotonically  in  E  to  the  solution 

u  from  below*  This  is  a  consequence  of  analytical  properties  of  u  to  be 
stated  in  what  follows. 


IV,  SOME  ANALYNICAL  RESULTS.  We  will  summarize  a  few  relevant  results  which 
enable  us  to  prove  the  existence,  uniqueness  of  a  solution  to  the  problem  (19) , 
as  well  as  the  convergence  of  the  iteration  method  proposed  in  the  previous 

section. 

First  we  state  the  announced  maximum  principle: 

(Rl.)  Let  u€E  satisfy  u  <  Q(u)  in  ,  the  interior  of  .  Then 

O 

u  cannot  attain  its  tnaximum  in  Qjy[  • 

As  an  immediate  consequence,  we  have 
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(R2.)  Assume  the  eKistence  of  a  solution  to  (19)-  Then  the  solution  must  be 
unique.  If  the  data  f  and  g  are  bounded  and  positive,  then  so  does 
the  solution  u  .  In  fact  we  get  o  ^  u  ^  max  {  ||  f  ||  ,  ||  g  ||  }  . 

To  prove  the  existence  of  a  solution,  we  need 

(R3.)  Suppose  u^  Q(u)  for  some  u£E  .  Then  the  vector  v  =  Q(u)  satisfies 
V  <  Q(v)  . 

(k) 

The  above  result  implies  that  the  sequence  u  of  iterates  defined 
in  (20)  and  (21)  converges,  that  is 
(  k) 

(R4*)  Let  {u  }  be  defined  as  in  (20)  and  (21)*  Then  the  sequence  is  mono* 
tonically  increasing  and  it  will  converge  in  E  to  the  solution  u  of 
(19). 

As  a  corollary  of  (R4.),  we  conclude  that 

(R5.)  The  iterative  numerical  method  for  the  discrete  problem  (19),  as.  des’^ 
cribed  in  §IV,  is  convergent  and  stable. 

V.  NUMERICAL  EXAMPLE.  As  an  example,  we  have  carried  out  extensive  numerical 

computations  for  the  special  case  when  the  damping  coefficient  p=0,  the  spring 

constant  q  =  0.2  ,  the  noise  level  r=0-2  and  the  unit  fuel  cost  c“0.2  * 

Also  we  chose  At  =  0.04  ,  N  =  4  and  Ax  =  Ay  =  0.  2  with  M  -  6  .  The  terminal 

value  f(x^y)  -x^+  ,  while  the  tolerable  error  e  =  0.5  10  ^  .  Under  these 

conditions,  it  is  found  that  the  iterative  method  described  in  §III  converges 

rather  rapidly-  Within  the  specified  error  e  ,  most  results  can  be  obtained 

in  less  than  30  iterations.  In  Table  1,  we  show  a  typical  set  of  computational 

results  for  the  optimal  cost  function  ^  .  The  results  are  grouped  in  three 

blocks  corresponding  to  n= 1,2,3.  In  each  block,  the  entries  in  the  horizontal 

directions  give  the  values  of  u^,  for  i  ranging  from  -6  to  +6,  and  the 

i  3 

vertical  entries  are  its  values  for  j-'-b,  -5,. ..,5,6  ,  Also  the  regions 
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corresponding  to  the  continuation  sets,  in  which  u  satisfies  =  u  ,  and 

their  complementary  regions  are  displayed  in  Figures  2-4,  corresponding  to 
n-  1,2,3.  In  the  figure,  the  continuation  set  is  marked  by  the  plus  (+)  signs, 
while  its  complement  by  the  star  (*)  signs.  The  curves  between  them  are  the 
free  boundaries.  These  figures  are  important  in  that  they  provide  a  control 
chart  to  implement  the  optimal  policy.  According  to  our  previous  result  [1], 
the  optimal  policy  is  to  do  nothing  if  the  initial  state  lies  in  the  continua¬ 
tion  set,  and  to  make  a  vertical  jump  to  the  boundary  otherwise.  Subsequently 
one  applies  a  control  only  when  the  state  reaches  a  free  boundary  by  a  vertical 
reflection.  In  view  of  the  figures,  we  observe  the  drastic  change  in  the 
shape  of  free  boundaries,  say  for  n  =  l  and  3  -  For  n-1  ,  the  initial 
correction  is  almost  one'^sided  for  saving  the  cost.  But  near  the  end,  n=3  , 
the  best  policy  is  a  two-sided  correction  to  bring  the  final  state  to  the 
origin  as  close  as  possible. 
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NUMBER  OF  ITERATION  TIMES: 
356  ARE  OBTAINED 

24 

X  1 — >- 

-0.0000 

0,2416 

-0,0000 

0,3616 

-0.0000 

1 . oooo 

-0.0000 

-0.0000 

NEW  VALUE  OF  U 
-0.0000  -0,0000 
-0.0000  1.0000 

-0 . OOOO 
0.8294 

-0.0000 

0.5525 

-0 . OOOO 
0.3556 

-0.0000 

0.2386 

-0.0000 

0.2016 

-0,0000 

0.5616 

-0.0000 

0.0415 

-0.0000 

1 .0000 

0.7894 

0.5125 

0,3156 

0. 1986 

6.  1616 

0.2016 

0.3216 

0,5216 

0.8015 

1 , oooo 

-0.0000 

-0,0000 

1 ,oooo 

O,  7494 

0.4725 

0.2756 

0. 1586 

0,  12  16 

0,  1616 

0.2816 

0,4816 

0,7615 

1  oooo 
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0.0816 
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0.3778 

0.  1871 
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0.0448 

0.0846 

0.2044 

0.4041 

0.6837 

0,9679 

-0.0000 

*  -0,0000 
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0.6437 
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0.  1644 
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0,0048 
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0,364  1 
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-brbbbb 
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0.9530 

-0 , 6666 

-0 . oooo 

0,9734 
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0,6670 

0.9448 
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-0 . oooo 

0.9756 

0.7637 

0.4841 

0. 2844 

0.  1646 

0.  1248 

0.1517 

0.2510 

0,4290 

0.6867 

0.9366 

-0 . oooo 

-b;b6bb 

0.8037 

0.524  t 

0,3244 

0.2046 

0.  1648 

0,  1899 

0.2839 

0,4558 

0,7073 

0^9273 

-6:6666 

-0,0000 

0,9796 

0.8436 

0.564  1 

0.3644 

0.2446 

0,2048 

0,2282 

0,3176 

0.4838 

0.7292 

0.9235 

-0.0000 

0 .  oooo 

-0.0000 

-0.0000 

-0.0000 

-0.0000 

-0.0000 

-0.0000 

-0.0000 

-0,0000 

-0 , OOOO 

-O . OOOO 

-0.0000 

-0.0000 
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Computed  values  of 


after  24  iterations 


Free  Boundai 


5 


References 


[1]  P.L.  Chow  and  J.L.  Menaldi,  Optimal  Corrections  of  a  Damped  Linear  Oscillator 
under  Random  Perturbations.  Trans,  of  the  Second  Army  Conf.  on  Appl.  Math, 
and  Computing,  (1984),  pp.  149-158. 

[2]  P.L.  Chow  and  J.L.  Menaldi,  On  the  Control  of  a  Linear.  Stochastic  System 
with  Finite  Horizon,  Trans,  of  the  First  Army  Conf.  on  Appl.  Math,  and 
Computing,  (1983),  pp.  301-310. 

[3]  P.L.  Chow  and  J.L.  Menaldi,  Additive  Control  of  Stochastic  Linear  Systems 
with  Finite  Horizon,  SIAM  J.  Control  and  Optlm. ,  to  appear. 

[4]  R.  Gonzolez  and  E.  Rofman,  On  Deterministic  Control  Problems:  An  Application 
Procedure  for  the  Optimal  Cost,  Research  Rept.  No.  151,  INRIA,  Le  Chesnay, 
France,  1982. 


54(; 


ERROR  ESTIMATE  FOR  THE  NUMERICAL  SOLUTION  OF  A  STOCHASTIC 


CONTROL  PROBLEM* 

P,L*  Chow  and  J.L.  Menaldi 
Department  of  Mathematics,  Wayne  State  University 
Detroit,  Michigan  48202 

ABSTRACT.  The  numerical  solution  of  an  optimal  stopping  problem 
for  a  diffusion  model  is  treated-  To  discretize  the  problem  we  use 
a  finite-difference  method  in  such  a  way  that  the  probabilistic  structure  is 
preserved.  That  is,  we  regard  the  discrete  system  probabilistically,  as  a  Markov 
chain  to  approximate  the  original  dif  fusion  process.  Thereby  the  strong  convergence 
and  the  error  of  approximation  are  obtained  for  the  numerical  solution  of  the 
optimal  cost  function. 

1.  INTRODUCTION.  We  are  interested  in  solving  numerically  the  Hamilton- Jacobi- 
Bellman  equation  arising  from  stochastic  control  problems.  As  a  first  step  we. will 
treat  a  relatively  simpler  problem  of  optimal  stopping  for  a  diffision  model. 

For  such  stochastic  control  problem,  one  is  referred  to  the  books  by  Bensoussan 
and  Lions  [1]  and  by  Shiryayev  [2],  among  others,  and  to  the  paper  by  Menaldi 
[3]  for  the  case  of  degenerate  diffusion. 

The  main  objective  of  the  paper  is  to  study  the  numerical  solution  of 
the  optimal  cost  function,  which  satisfies  a  variational  inequality  as  to  be 
shown.  In  particular  we  wish  to  get  an  error  estimate  for  the  approximate 
solution  by  using  a  finite-difference  scheme.  This  scheme  is  devised  in  such  a 
way  that  the  probabilistic  structure  of  the  problem  is  preserved.  It  will  be 
compatible  with  the  approximating  Markov  chain  model  to  the  original  diffusion 
process. 


*This  work  was  supported  by  the  ARO  Contract  DAAG29-83-K-0014. 
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The  idea  of  using  a  Markov  chain  approximation  is  not  new.  Among  others, 
Kushner  [A]  has  used  this  approach  extensively  in  the  context  of  stochastic 
control.  However,  he  adopted  the  method  of  weak  convergence  or  the  martingale 
formulation  in  the  approximation  theory.  In  constrast,  we  will  be  concerned  with 
an  error  estimate  for  some  approximate  solution  in  a  strong  sense.  As  far  as 
we  know,  this  kind  of  result  has  not  been  obtained  before. 


II-  DIFFUSION  MODELS.  The  state  of  the  dynamial  system  under  consideration  is 
governed  by  a  stochastic  differential  equation  of  It&  type.  It  reads 


(1) 


dy(t)  =  a  ry(t)]dt+  s  a  [y(t)]dW  (t)  ,  t  o  , 
o  K  K 


y(o)  =  X 

which  is  defined  in  a  q-*dimensional  Wiener  space  (n,F,F^,P,W(t)  ,t  ^  o)  ,  i,e., 
(n,  F>P)  is  a  completed  probability  space  and  W(t)  =  (Wj^(t)  ,k=l,  •  • .  ,q)  is  a 
q-dimentional  standard  Wiener  process  with  respect  to  {F^,t  ^o}  •  The  co¬ 
efficients  =  (a^^(x) ,i=l, . . • ,d)  ,  k-l,.,-,q  ,  are  given  Lipschitz  continuous 

functions  on  IR^  ,  i.e-. 


q 

(2)  E  ^  ^  c|x-x'|  ,  for  x,x'  in  ]R^  , 

k-o 

where  | ♦ |  denotes  the  Euclidian  norm  in  •  In  the  equation  (1) ,  x  is  the 
initial  state  at  the  time  t=o  • 

Suppose  that  the  only  control  we  have  on  the  system  is  to  decide  whether 
or  not  we  should  stop  the  evolution  of  the  state.  This  decision  should  be 
adapted  to  the  observation  of  the  state,  which  is  assumed  to  be  the  actual  state 
of  the  system  and  is  completely  observable.  Therefore,  if  r  =  t  (co)  ,  ooCfl  , 
is  the  random  time  at  which  we  decide  to  stop  the  system,  it  is  a  stopping  time 
with  respect  to  {F^,t  o}  ,  i*e., 
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(3)  for  every  t  o  . 

Note  that  t  ^  o  and  t  (CD)  may  be  infinite  for  some  cD€f2  . 

Associated  with  each  decision,  or  each  stopping  time  t  ,  we  introduce 
an  average  cost  functional 

(4)  J(t)  *  E{  ^f[y(t)dt  +  hty(t)]]!  (t  <  T)  }  , 

where  F  and  h  are  given  functions.  They  represent  the  unit  operating  cost 
and  the  cost  of  stopping  the  system,  respectively.  The  horizon  T  may  be  finite 
or  infinite.  Sometime  it  can  even  be  raiidom,  e.g., 

(5)  T  =  inf  {  t  0  =  y(t)^ 

which  is  the  first  exit  time  of  the  process  y(t)  from  some  closed  subset  O 
of  .  The  function  1  ( t  <  T)  equals  to  1  if  t<T  and  0  otherwise. 

The  control  problem  is  to  choose  t  so  that  the  average  cost  J(t)  is 

A 

minimal.  This  optimal  cost  u(x)  is  defined  by 

(6)  u(x)  =  inf  {  J^(t )  :  t  satisfying  (3)  }  . 


where  J  is  written  as  to  show  its  dependence  on  x  . 

Assuming  then  u  is  finite  and  smooth,  we  apply  the  method  of  dynamic 
programming  to  obtain  the  equation  for  optimality 


(7)  max{Au-f,  u-h} 


in 


6 


while 

(8)  Au(x)  = 


u(x) 


d 

S  a 

i=l 


(x)a,u(x) 

lO  1 


with  9  =  9/9x  ,  0..  =  9^/dx.9x.  .  The  equation  (7)  is  commonly  referred  to 

i  1  ’  xj  1  j 

as  a  variational  inequality.  In  case  that  T  is  given  by  (5),  we  must  add  to 
the  equation  (7)  the  boundary  condition 
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(9)  u 


o 


on  . 


Suppose  that  u  be  the  solution  of  the  equation  (7)  subject  to  the 

A 

boundary  condition  (9)-  Then  we  define  the  continuation  set  [u  <  h]  =  {xC©  : 
u(x)  <  h(x)  }  >  which  determines  the  optimal  stopping,  time  t  .  Since,  in  view 

A  A 

of  (7),  u  <  h  implies  Au  =  F  ,  we  have 
[u  <  h]  C  [Ati^  f]  , 

which  means  that,  in  the  continution  set,  it  is  cheaper  to  let  the  system  evolve 
freely  without  stopping-  But,  as  soon  as  the  state  leaves  the  continuation  set, 
we  must  stop  the  system  immediately  to  avoid  a  higher  cost. 

In  what  follows,  a  finite-difference  scheme  for  solving  the  variational 
inequality  (7)  in  ©  subject  to  the  condition  (9)  will  be  proposed-  Proba¬ 
bilistically  the  diffusion  process  will  be  replaced  by  an  appropriate  Markov 
chain  so  that  the  structure  of  the  problem  is  preserved.  As  mentioned  in  the 
previous  section,  our  real  interest  goes  beyond  the  present  problem.  Hopefully, 
this  approach  may  be  extended  to  treat  a  certain  class  of  nonlinear  problems 
for  which  the  coefficients  f  and  o  could  depend  on  a  control  parameter,  say 
a  ,  so  that  A  -  A (a)  and  the  equation  (7)  becomes 

(10)  max  {  max[A(a)  u  -  f(a)],u-h}  =  0  • 

a 

This  problem  will  be  studied  in  the  future. 


Ill-  MARKOV-CHAIN  MODELS.  Let  At  denote  a  small  unit  of  time.  The  state  of 
the  dynamical  system  at  the  discrete  times  nAt  ,  n  =  0,l,--.,  is  given  by 
Z(n,At)  ,  which  evolves  according  to  the  equation 


(11) 


Z(n+1) 

Z(o) 


q 

Z(n)  +  2  o^[Z(n)]^ 
k=o 


n-fi 

k 


X  , 
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where  • • • »!”)  with  Its  components  being  independent,  identically 

distributed  random  variables,  for  n=l,2,...,  and  ,i=l,2, . , .  ,d)  , 

k=l,2,.,.,q  ,  are  given  as  before. 

Here  a  stopping  time,  the  control  variable,  is  an  integer-valued  random 
variable  v  =  v(co)  satisfying  • 

(12)  {v^n)eF",  n=l,2 . 

where  is  the  o-algebra  generated  by  the  variables  ,  •  • .  • 

The  average  cost  associates  with  each  control  v  is  given  by 


(13)  J^(v,At) 


(vAN)-i  1 

2  f[Z(n)At  +  hlZ(n)ll(v  <  N)  I  , 
n=o  J 


where  the  horizon  N  is  a  positive  Integer  defined  by 

(14)  N  "  inf  {  n  o  s  Z(n)^d}  . 

Again  an  application  of  Dynamic  Programming  yields 

(15)  raax{  -  f  ,  u^  -  h )  =  o  in  0  . 


Here  we  set 

(16)  A^u(x)  =  i|u(x)  -  E[u(x  +  2  * 

*•  k*o  J 

A 

and  (x)  is  the  optimal  cost,  i.e., 

(17)  u^(x)  =  inf  {j^(v,At)  :  V  satisfying  (12)} 

The  boundary  condition  for  the  Equation  (15)  is 

(18)  =  o  on  di3  . 

Let  ^0*^1  positive  numbers  such  that 


Also  we  choose  the  probability  distribution  of 
the  following 


•  »^q)  to  satisfy 


and  have  disjoint  supports  for  kj^E  , 

(20)  (p{^;;»PAt}=a, 

'  o  o 

P{  =  ±  ^/l^t}  =  ,  for  k=l, 


Then  the  operator  can  be  expressed  as 


(21) 


A^u(x)  =  -^2  [u(x+>/  P^At  Cfjj^Cx))  -  2u(x) 

lC“*l 

a 

+  u(x-V  p^At  dj^Cx))]  -  [u(x  +  p^Ato^(x))-u(x)] 


Note  that  for  a  smooth  function  u(x)  we  have 


A^^U(X)  = 


-  d  q  /-I  ^  t 

-  “  2  S 

i,j=l  k-1 


I  Oj^(x))ds 


(22) 


-  2 
j=l 


OiQ(x)9iU(x  +  tPoAtOo(x))dt 


One  of  the  key  properties  of  the  operator  A^^  is  the  validity  of  the 
maxlmuni  principle  which  says 


(23) 


if  u(x)  attains  a  local  maximum  at  x  ,  then 

o 

A^^u(x^)  ^  o  for  a  sufficiently  small  At  . 


Now,  in  view  of  (22),  we  deduce  that,  if  the  second  derivatives  of  u  are 
continuous  in  a  neighborhood  of  x  , 

(24)  |A^u(x)  -  Au(x)  I  -i-  o  as  At  ->  o  , 


In  passing  we  remark  that  the  alternative  form  (21)  for  A  has  a 

&AL 

advantage  over  the  conventional  finite-differencing.  This  enables  us  to  reduce 
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the  number  of  coupling  among  the  equations  from  an  order  of  2  to  that  of  2d  . 
From  a  numerical  point  of  view,  this  reduction  is  significant- 

IV.  APPROXIMATION  RESULTS »  Having  described  two  stochastic  control  models  in 
the  previous  two  sections,  we  now  come  to  examine  the  approximation  problem- 
That  is,  if  we  approximate  the  diffusion  model  by  a  Markov  chain  model,  what  is 
the  approximation  error?  In  particular  we  are  interested  in  the  error  estimate 
for  computing  the  optimal  cost  by  a  finite-difference  scheme-  Our  approach  to 
this  problem  has  two  distinct  features-  On  one  hand,  we  exploit  the  analytical 
characterization  of  the  optimal  cost,  i-e.,  the  variational  inequality  or  theHamilton- 
Jacobi-Bellman  equation  by  replacing  the  differential  operator  A  by  a  finite- 
difference  operator  A^^  *  On  the  other  hand,  we  look  at  the  optimal  cost 
itself  through  an  appropriate  approximation  of  the  state  equation.  Specifically, 
the  approximation  involves  replacing  the  Brownian  motion  by  a  suitable  random 
walk. 

There  are  many  published  papers  concerning  the  above-mentioned  approxi¬ 
mation  problem*  But,  to  our  knowledge,  none  has  addressed  to  the  strong 
(versus  weak)  convergence  in  the  approximation.  Since  we  wish  to  eventually 
include  the  deterministic  control  problems  in  our  analysis,  it  will  be  necessary 
to  deal  with  controlled  diffusion  processes  with  possible  degeneracy.  This  is 
one  of  the  reasons  why  we  are  interested  in  seeking  an  approximation  that  will 
yield  the  following  kind  of  error  estimate 

(25)  E{  sup  1  y(t)-y^^(t)  |}  <  C  (At)^ 

0«t«T 

where  T  >  o  is  finite,  a  positive  constant  depending  on  a  and  T  ,  and 

{y^^t).t  ^  o)  is  an  approximation  process  of  {y(t),t  o  }  constructed  from 
the  Markov  chain  (11) - 
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We  remark  that  once  an  estimate  like  (25)  has  been  established,  a  cor¬ 
responding  estimate  for  the  optimal  cost  functions  (6)  and  (17)  follows 
Immediately.  Therefore,  we  shall  only  be  concerned  with  an  estimate  of  the 
type  (25)  by  giving  some  probabilitic  arguments. 

The  construction  of  the  process  {y  (t),t  ^o}  is  suggested  by 
Skorokhod  [5],  known  as  the  Skorokhod  representation.  On  the  same  Wiener 
space  where  the  stochastic  equation  (1)  is  based,  we  define,  by  induction, 
the  random  variables  t]’^  =  (tIq.tIi .  •  •  •  .n”  )  and  iS"  =  (»5^,d", . . .  ,,5"  )  as 
follows 

^k  ^  k=l,...,q, 

|w”(t)|-||“'*’|}  ,  n-o.l . 

(26)/w^«(t)  -  .  „=o.l . 


■ 


n+1 


‘‘k 


-I-  ...  -I-T.”  .  n=1.2.... 

K.  k  999 


and 


(27) 


n 


6^  = 


n 


n 

^o  ♦ 


t  +  . ..  -I-  t"  ,  n=l,2. 


It  is  possible  to  show  that  the  random  variables  t]”  have  the  same  pro¬ 
bability  distribution  (20)  for  the  random  variables  5*^,  n=l,2, _ ,  and  that 

(28)  e|  S  I  =  nAt  . 


Consequently  we  can  replace  the  equation  (11)  by 
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(29) 


^  n+l 

Z(n+1)  =  Z(n)  +  2  \[Z(n)]\  .  n=o,l,..., 

k=o 


Z(o)  =  X  , 


which  represents  the  same  Markov  chain. 


At . 


Now  we  define  the  approximating  process  {y  (t),t  ^^o}  by 


A  Q  Q 

(30)  y  ^(t)  -  Z(n)  if  2  O^(co)  t  <  2  (CO)  . 


k-o 


k-o 


By  construction  this  process  is  adapted  and  piecewise  constant  so  that  the 
equation  (29)  can  be  written  as 


y'^*^(t,co)  *  X  + 


(31) 


\9 


°  %[y^’'(s)]ds 


q 

+  2 


,n 


'I  r  At 

2  \[y  ''(s)]dWj^(s)  , 

=1  ''O 


if  2  ,9^«o)^t<  2  Ojj^^co)  . 
k=o  k=o 


With  aid  the  above  representation,  we  are  able  to  verify  the  following  result* 

Theorem:  Under  the  assumption  (2)  and  with  the  representation  (31),  for  any 
given  numbers  p  1  and  T  >  o  ,  there  exists  a  constant  C  =  C(p,T)  >  o  , 
depending  on  p  and  T  ,  the  same  constant  as  in  (2) ,  such  that 

(32)  e|  sup  |y^(t) -y^^t)|pi  <  C(l+ lx|P)(At)P^* 

lost^T  ^  ^  J 

for  any  x  in  and  o  <  At  ^  1  . 

Here  the  proof  of  the  theorem  will  be  omitted.  It  can  be  found  in  a  forthcoming 
paper  by  the  authors  [ 6 ] . 
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In  terms  of  practical  computation,  we  note  that  the  variable  x  in  the 
discretized  equation  is  still  continuous.  To  discretize  the  variable  x  ,  we 
may  proceed  as  follows.  First,  select  a  convenient  basis  {ej^(x)  ,k=l,2, .  * , ,  } 
in  a  suitable  function  space,  and  write 

oo 

(33)  uCx)-  Z  X^(u)ej^(x)  , 
k”! 

where  we  may  take  a  finite  sum  from  the  series.  Then  by  means  the  equation  (15), 

we  obtain  a  system  of  complementary  inequalities  in  in  the  space 

k 

TR  .  Getting  an  error  bound  by  using  (33)  is  much  easier  than  that  for  the 
previous  case.  For  instance,  if  we  choose  a  mesh  in  TR^  of  the  size  At  , 
i.e*,  X  =  iAt  ,  where  i  =  (ij^,,..,i^)  with  integers  as  components,  then  it 
is  possible  to  slightly  modify  the  Markov  chain  (29)  in  such  a  way  that  (32) 
is  preserved  and  the  restriction  of  (29)  to  the  mesh  is  still  a  Markov  chain. 

This  means  that  the  corresponding  variational  inequality  (15)  for  the  new 
Markov  chain  is  the  restriction  of  (15)  to  the  mesh.  Thereby  all  variables 
now  become  discrete  and  the  estimate  (32)  holds. 

A  paper  containing  the  details  of  the  above  arguments  is  under  preparation, 
and  the  generalization  of  the  present  work  to  other  types  of  control  problems 
is  a  subject  of  our  current  research. 
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FINITE  DIFFERENCE  METHODS  FOR  ELLIPTIC  SYSTEMS 


John  C.  Strikwerda 
University  of  Wisconsin-Madison 

ABSTRACT >  Ihis  paper  is  an  introduction  to  finite  difference  methods  for 
elliptic  systems  of  partial  differential  equations.  Elliptic  systems  arise  in 
many  areas  of  applications  such  as  incompressible  fluid  flow  and  elasticity* 
The  theory  for  elliptic  systems  is  reviewed  and  the  analogous  theory  for 
finite  difference  schemes  is  presented. 

I.  INTRODUCTION.  Elliptic  systems  of  partial  differential  equations 
arise  in  many  areas  of  science  and  engineering,  and  hence  it  is  important  to 
develop  numerical  methods  for  their  solution.  ‘In  this  paper  we  discuss  finite 
difference  methods  for  elliptic  systems,  presenting  both  theoretical  ressults 
and  results  of  sample  calculations. 

We  take  as  our  definition  of  elliptic  systems  that  given  by  Douglis  and 
Nirenberg  (3). 


Definition  1 . 1 

A  system  of  partial  differential  equations 


n 

(1*1)  y  £. . (x,D)u. (x)  = 

j=1  ^  ^ 

- 

i  “  1,  *•*,  n. 

where  D  =  (-i  3  ,  -i  3  ), 

*1 

is  an 

elliptic  system  if 

and  (T^)  ^2^  such  that 

1)  deg  A^j(x,5)  ^  Tj 

2)  there  are  positive  constants 

c  and 

R  such  that 

(det  £..(x,5)|  >c|c|^  for 

*  J 

Ul  >  : 

R  and  with  2p  - 

The  system  (1.1)  holds  in  a 

domain 

n  in  and  the 

n 


£#  are  continuous  on  the  closure  of  ft. 

Another  way  of  expressing  condition  (2)  is  to  let  the  suni 

of  terms  of  which  are  homogeneous  of  degree 

det  is  a  homogeneous  polynomial  of  degree  2p.  Condition  (2)  is 

then  equivalent  to  requiring  that 

det  l^j(x,e)  ^  0  for  C  0. 

Without  loss  of  generality  we  can  assume  that  a.  <0  and  x .  >  1 
for  1  <  i,  j  <  n.  ^  ^ 

We  now  present  some  examples  of  elliptic  systems  that  occur  frequently  in 
applications.  Our  first  example  is  the  Cauchy-Riemann  equations 
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3u  3v  _  * 
3x  "  3y  “ 

3u  ,  3v 
3y  ° 


with  5  0  and  =  1.  Also,  the  first-order  Poisson  equations 

are  el31ptic  with  =  Oj  =  -1,  <73  “  0  and  =  T2  »  1/  ^3  =  2  given 

that  (u,v,p)  =  (u^,  U2/  «3).  The  two-dimensional  equations  of  linear 
elasticity 

.  au  - 

‘11  ■  7;  ■  *’2  *  " 

^12  "  ^3  ay  ■  ^3  3x  "  ° 

.  au  3v  _  _ 

^12  "  ^2  ax  ~  ay  ”  ° 


are  an  elliptic  with  (t^^#  ^12'  ^22'  ^^2'  ^3^  ^^4' 

constants  P]^  are  given  by 

2,-1 

=  (1  -  h  ) 

^2  “  1  +  h)"^ 

P3  =  n<l  -  n^)“^ 

where  n  is  Poisson's  ratio. 

Die  final  example  is  the  Stokes  equations 

„2  ap  - 

,u-^=0 

72^  -  |E  .  0 

ay 

du  dv 
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where  0^=0^=  0,  O3  =  -1  and  =  2  and  T3  =  1  with  {u,v,p)  » 

Uj#  U3). 

II ♦  REGOIARITY  ESTIMATES.  Hie  prototype  of  all  elliptic  systems  is  the 
scalar  Poisson  equation 

(2.1)  V^u  =  f 

on  a  domain  ft.  The  classical  Schauder  estimates  show  that  the  order  of 
differentiation  of  the  solution  to  (2.1)  is  two  more  than  the  order  of 
differentiation  of  the  data  f  (Gilbarg  and  Trudinger  (5)),  This  increase  in 
the  differentiability  of  the  solution  over  the  differentiability  of  the  data 
is  a  characterizing  feature  of  elliptic  systems. 

For  the  case  that  the  elliptic  system  (1.1)  has  constant  coefficients,  we 
prove  an  interior  regularity  estimate  that  relates  the  orders  of 
differentiation  of  the  solution  to  that  of  the  data. 

Theorem  2. 1 
Let 


(a.)  and  (T.)  as  in  definition 
i  D 

R  )  for  any  real  number  r, 
moreover  there  exists  a  constant  C(r)  such  that 

'“'r+T  -  "'j'l+T.  •=  '”'0) 

]=1  ■’3  1=1  i 

=  C(r)(nfi^_^  +  hwBq)  . 

Proof 

Consider  the  system  (2.2)  written  in  matrix  notation 

L(D)w(x)  =  f{x)  . 

We  begin  by  using  the  fourier  transform  to  obtain  the  system 

A 

L(5)w(C)  =  f(5)  . 

By  condition  (1)  of  definition  1.1,  for  |c|  >  R  the  matrix  L(C)  can  be 
decomposed  as 

UK)  =  ur  L(c)ur 

where  L(  C )  is  bounded  in  norm  independently  of  |  C 1  and  1 5  1  and  1 1 ^ 

Cf  T. 

are  the  diagonal  matrices  with  entries  |c(  and  |cl  #  respectively. 

Moreover,  for  Uj  >  R,  L(C)  is  an  invertible  matrix  with  norm  bounded 
independently  of  | C |  • 


(2.2) 


I  t  (■»«  -  f.  , 

]=1 


be  an  elliptic  system  defined  on  with 

n“®i  ^  r+T. 

1.1.  If  e  H  (R  )  then  w^  e  H  ^( 
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Therefore ,  for  j  ^  j  >  R  we  have 

i5i^  w(o  = 

and  for  any  real  number  r 

w(5) « L(5)"‘'ur"‘'  f{€) . 

By  Parseval's  relation  we  then  easily  obtain 

I  l-lo) 

3=1  *'  3  i=1  i 

for  some  constant  C(r)  depending  on  L(5)  but  independent  of  f  or  w. 

This  completes  the  proof. 

Estimates  of  the  same  form  as  (2.3)  hold  for  elliptic  systems  with 
variable  coefficients.  The  theory  of  pseudo^differential  operators  can  be 
used  to  extend  the  ideas  of  the  above  proof  to  the  case  with  variable 
coefficients . 

From  the  regularity  estimate  on  R^  on  can  obtain  interior  regularity 
estimates  for  more  general  domains*  To  do  this  we  consider  the  system  (2.2) 

on  a  domain  R  in  Let  ^(x)  be  a  C  cut-off  function  which  is  unity 

on  a  domain  and  vanishes  off  a  domain  Rq  with  R^  C  R^  C  R.  The 

operator  L(D)  applied  to  the  function  u(x)  =  9(x)u(x)  gives 

(2.4)  L(D)u{x)  =  9{x)f(x)  -  M(x,D)u(x) 


where  M(x,D)  is  a  matrix  of  differential  operators  such  that  the  {i,j)-th 
operator  has  order  less  than  Moreover  M(x,D)  vanishes  outside  of 

Rq.  Ihus  the  system  (2-4)  can  be  considered  as  holding  on  I^.  The  estimate 

(2.3)  then  gives 


ITT  I 


where  C(r)  also  depends  on  9  and  its  derivatives.  Using  a  sequence  of 
sets  such  as  R^  and  Rq  we  easily  obtain  the  estimate 


This  interior  regularity  estimate  shows  that/  if  a  solution  to  the  system 
(2.2)  exists  on  the  domain  R/  then  the  solution  u  is  smooth  in  the 
interior  of  R*  'Rie  degree  of  smoothness/  or  differentiability/  is  dependent 
on  the  differentiability  of  the  data. 


III.  FINITE  DIFFERENCE  SCHEMES-  We  now  consider  finite  difference 
schemes  for  elliptic  systems.  We  begin  by  examining  the  use  of  central 
difference  formulas  to  approximate  the  Cauchy-Riemann  equations 
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(3.1) 


(3.2) 


3u  _3v 
3x  3y 


0 


3u 

3y 


=  0 


on  the  unit  square. 


2  2 

For  boundary  data  specify  u-x^-y  at  x=0  and  x  =  1,  and 
specify  v  -  2xy  at  y  =  0  and  y  “  1*  Using  central  difference 
approximations  along  with  one-sided  differences  to  (3,1)  at  the  boundaries  one 
obtains  as  the  solution 


(3*3) 


u.  . 
ID 


x2 

1 


+  e. 

j. 


’ij  - 

where  =  ih,  yj  «  jh.  and  h  =  1/(2  M) .  (This  example  is  discussed  in 
more  detail  in  Eube  and  Strikwerda  (2)). 

i  even 
i  odd 

Notice  that  the  solution  is  second-order  accurate,  but  not  smooth.  In 
particular,  the  second  divided  difference  of  Uj^j  with  respect  to  x  does 

not  converge  to  the  second  partial  derivative  of  u(x)  with  respect  to  x. 
This  nonconvergence  of  divided  differences  does  not  occur  for  the  usual  finite 
difference  schemes  for  a  single  second-order  elliptic  equation.  Bramble  and 
Hubbard  (1)  showed  that  for  solutions  of  a  finite  difference  scheme 
approximating  a  single  second  order  elliptic  equation,  the  divided  differences 
converge  to  the  corresponding  partial  derivatives,  provided  the  data  is 
sufficiently  smooth.  Moreover,  if  the  divided  difference  formulas  are 
accurate  enough,  the  rate  of  convergence  of  the  divided  differences  is  the 
same  as  that  of  the  solution  itself.  Similar  results  have  been  given  by  ThomS 
and  Westergren  (8),  Vainikko  and  Tamme  (11),  and  others. 


The  quantity  is  given  by 


As  the  example  shows,  not  all  consistent  finite  difference  schemes  for 
elliptic  systems  have  the  same  regularity  property  that  schemes  such  as  the 
standard  five-point  Laplacian  scheme  have.  Bube  and  Strikwerda  (2)  showed 
that  a  class  of  finite  difference  schemes,  called  regular  schemes,  do  have  the 
regularity  property.  These  regular  schemes  may  be  defined  as  follows. 

Definition  3.1. 

For  i,j  =1,  ...»  n,  let  be  a  difference  operator  with  symbol 

A^j(h,x,h).  The  system  of  difference  equations 


(3.4) 


I 

j=1  ^ 


(x)  *  f^(x)  i*  1,  •*.,  n  , 
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and 


is  a  regular  elliptic  system  if  there  are  sets  of  integers 
f  ^  n  11-1 

j'1  L^j  is  a  difference  operator  of  order  at  most 

and  if  there  are  positive  constants  C,  Hq  such  that 

(3.5)  Idet  t^^(h,x,C)l  > 

for  ^0  ^  ^  ^  h  0  <  h  <  0,  where  2p  =  E  +  S  T^. 

The  symbol  of  a  difference  operator  L  is  defined  by  L  e^^^  = 
£(h/X,h5)e^  i.e.  the  symbol  is  the  factor  multiplying  e^’'^  which 
results  from  applying  the  operator  to 


Oie  can  easily  check  that  the  finite  difference  scheme  for  the  Cauchy- 
Kiemann  equations  which  uses  central  differences  does  not  satisfy  the 
determinant  condition  (3.5).  The  symbol  for  the  central  difference  scheme  for 


(3*1)  is 


A 


sin  hC 


1 


V 


i  sin  hC , 


-i  sin  h? 


A 


i  in  hg 


V 


and  the  absolute  value  of  the  determinant  is  (sin^  h£^  +  sin^  * 

This  determinant  vanishes  for  \o),  (0,7rh~^),  and 

-1-1  12 

(irh  )•  As  shown  by  Bube  and  Strikwerda  (2)  the  loss  of  smoothness  in 

the  example  is  a  consequence  of  the  vanishing  of  the  determinant • 


There  are  several  regular  schemes  for  the  Cauchy- Riemann  equations.  One 
common  regular  schemes  is  the  staggered  grid  method  used  by  Qiil  and  Balgovind 
(4),  Lomax  and  Martin  (6)^  and  others*  A  regular  scheme  using  a  non-staggered 
grid  is  obtained  by  using  the  following  approximations 


3a  ~ 
3x  ‘ 


2 


3v  ~  .  h  .2  , 

3y  xO  6  y+  y- 

3u  **  *  *  .2 

3y  yO  6  y-f  y- 


3v  ^ 
3x  " 


5  *v 

xO 


X+  X- 


# 
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where  the  second  subscript  of  0,  +,  or  -  indicates  a  central,  forward,  or 
backward  divided  difference .  If  we  set 

C(C)  =  (sin(h5)  +  ~  h?>h"^ 

then  the  symbol  of  this  system  of  difference  equations  is 

i  5(5^)  -i 


i  C(52)  i 

and  the  determinant  is 

CU(Ct)|^  +  U(e2)l^] 

which  does  not  vanish  for  |c^|  +  zsJ^o*  Research  is  currently  being 

done  on  finding  an  efficient  algorithm  for  solving  the  system  of  finite 
difference  equations  resulting  from  this  approximation. 

A  regular  finite  difference  scheme  for  the  Stokes  equations  has  been 
proposed  by  Strikwerda  (9).  This  scheme  was  shown  to  be  second-order  accurate 
on  non-orthogonal  non-uniform  grids.  This  scheme  was  also  used  with  the 
incompressible  Navier-Stokes  equations,  Strikwerda  (10)  and  Nagel  and 
Strikwerda  (7). 

Currently  research  is  being  done  to  consider  the  regularity  at  the 
boundciry  of  the  domain  for  finite  difference  schemes.  It  should  be  possible 
to  determine  the  effect  of  the  nximerical  boundary  conditions  on  the  accuracy 
and  smoothness  of  the  solution  near  the  boundary.  It  is  also  important  to 
devise  better  methods  for  solving  the  finite  difference  equations  arising  from 
approximations  to  elliptic  systems . 
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GENERALIZED  ISOVECTORS  AND  SIMILARITY  SOLUTIONS* 


Frank  B.  Estabrook  and  Hugo  D.  ^ahlqulst 
Jet  Propulsion  Laboratory  169-327 
California  Institute  of  Technology 
4800  Oak  Grove  Drive 
Pasadena,  CA  91109 


ABSTRACT  The  manipulatory  techniques  of  differential  geometry 
allow  systematic  derivation  of  invariance  groups  of  sets  of  partial 
differential  equations.  For  each  generator  or  isovector  of  the 
algebra  of  such  a  group,  a  family  of  similarity  solutions  may  be 
found  by  integrating  a  set  of  equations  with  one  fewer  independent 
variables.  The  equations  for  the  components  of  the  Isovectors  them¬ 
selves  are  overdetermined  linear  partial  differential  equations, 
which  can  be  solved  algorithmically  by  symbolic  manipulation  pro¬ 
grams.  Such  families  of  similarity  solutions  can  be  generalized  by 
relaxing  the  definition  of  Isovector,  so  that  its  components  satisfy 
nonlinear  equations,  and  the  group  property  is  lost.  The  resulting 
"generalized  similarity  solutions"  nonetheless  can  be  useful  and 
physically  significant. 

I .  DIFFERENTIAL  GEOMETRY  AND  APPLIED  MATHEMATICS  Topics  such  as 
partial  differential  equations  (not  riecessarily  linear),  Hamiltonian 
mechanics,  variational  and  perturbational  techniques,  and  Lie  group 
theory  can  all  be  treated  at  a  deep  level  as  described  by  tensor 
Structures  on  differential  manifolds.  It  should  thus  be  stressed 
that  differencial  geometry  is  a  much  broader  discipline  than  that 
traditionally  taught,  which  over-emphasized  Riemannlan  geometry. 
In  general,  one  does  not  have  a  metric  tensor  gij,  one  cannot 
"raise  and  lower"  indices,  and  one  must  carefully  maintain  the 
distinction  between  "contravariant"  objects  such  as  vector  fields, 
and  "covariant"  objects  such  as  1-form  fields.  Without  a  metric, 
one  is  limited  in  differentiation  operations  to  the  generalized 
curl,  d  (applied  to  completely  antisymmetric  covariant  fields, 
n-forms),  and  to  the  "substantial"  or  Lie  derivative  operation 
along  a  given  vector  field.  The  inner  product,  or  contraction  of 
vectors  and  forms,  and  the  outer  or  antisymmetrized  tensor  product 
of  forms,  completes  the  set  of  operations:  d, 

suiting  "exterior  calculus"  has  been  expounded  in  a  number  of  texts, 
and,  in  our  opinion,  will  in  the  near  future  be  routinely  taught  at 
the  undergraduate  level.  We  recommend  a  new  text  by  William  L. 
Burke. ^  ^  ^ 

II,  CARTAN  ~  KAHLER  THEORY  Sets  of  first  order  partial  differ¬ 

ential  equations  are  treated  in  Cartan-Kahler  theory  as  defined  by 
ideals  of  differential  forms.  Such  an  ideal  is  generated  by  a  set 
of  forms  a^,  or  by  a  set  algebraically  equivalent  to  the  a^— 

i.e. ,  any  form,  say  B,  in  the  second  set  can  be  expressed  as  a  sum 
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over  all  forms  of  the  same  rank  in  the  first  ideal,  with  functions 
(or  scalar  fields)  as  coefficients,  and  conversely.  Any  form  in 
the  ideal  of  these  generators  may  be  so  expressed,  and  in  fact  the 
defining  property  is  that  all  the  forms  in  the  ideal  vanish  when 
pulled  back  (or  mapped)  onto  an  "Integral"  submanifold.  The  set 
should  be  completed  with  all  other  generators  having  this  property 
of  vanishing  on  any  of  the  Integral  submanifolds,  in  particular  the 
forms  d«i.  The  Integral  submanifolds  are  solutions  of  sets  of 
partial  differential  equations  which  appear  when  a  choice  of  inde¬ 
pendent  and  dependent  variables  is  made. 

Topics  such  as  the  symmetries,  potentials,  conservation  laws,  and 
so  on,  of  a  set  of  p.d.e.'s  can  all  be  treated  systematically  when  a 
closed  differential  ideal  can  be  found  to  express  it.  We  believe 
this  approach  to  be  particularly  useful  when  a  set  is  nonlinear  (or 
when  it  appears  to  be  nonlinear  in  the  variables  first  given). 
Consider  as  an  example  an  8  dimensional  space,  with  points  continu¬ 
ously  labeled  by  coordinates  or  parameters  U,  £1,  A,  B,  F,  G,  p,  t, 
and  in  it  an  ideal  I  generated  by  a  set  of  two  1-form  fields  and 
four  2-form  fields: 

dU  -  Adt  -  Bdp 
d£^  -  Fdt  -  Gdp 

dA-'dt  +  dB^dp 

dF^dt  +  dG'dp  (1) 

_7tT  9  9 

dB-.dt  +  dA-dp  -  [e  (F  -  G  )  -  B/p]dp-dt 

dG-'dt  +  dF-dp  -  [2(BG  -  AF)  -  G/p]dp-dt 

I  is  closed,  that  is  diCl.  Cartan's  theory^^)  allows  us  to  calcu¬ 
late  the  maximum  dimension  of  integral  submanifold  to  be  2;  if  we 
adopt  P  and  t  as  Independent  variables,  the  condition  chat  all 
(Che  generators)  of  1  vanish  reduces  to  a  set  of  p.d.e.'s  that  in 
fact  are  of  great  interest  in  the  theory  of  (nonlinear)  cylindrical 
gravitational  waves^^^; 
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SollCon  solutions,  and  many  beautiful  properties  have  recently  been 
discovered  for  this  set.  The  ideal  I  is  one  way  of  representing 
this,  that  allows  the  underlying  group  and  algebraic  structures  to 
be  uncovered. 

III.  CAUCHY  CHARACTERISTICS  When  presented  with  an  ideal  I  for 
analysis,  the  first  associated  structure  to  seek  is  a  vector  field 
V  that  has  Che  property: 


V  J  I  Cl 


(3) 


568 


If  it  exists,  such  a  field,  denoted  a  Cauchy  Characteristic  of  I, 
when  contracted  on  any  form  in  I,  yields  a  form  (of  one  less  rank) 
also  in  t.  These  equations  are  an  overdetermined  homogeneous 
linear  algebraic  set  for  the  components  of  V.  An  example  is  the 
field  of  flow,  or  set  of  trajectories,  that  belongs  to  a  Hamllton- 
Jacobl  Equation.  It  is  easily  shown  that  any  Cauchy  Characteristic 
vector  V  must  lie  in  the  Integral  manifolds  of  maximum  dimension, 
A  non-trlvlal  further  result  is  then  Car tan's  theorem,  that  for  any 
such  V  a  coordinate  transformation  can  be  found  that  eliminates  one 
(Independent)  coordinate  from  explicit  appearance  in  the  set  of 
generators  for  I.  The  solutions  thus  depend  on  one  less  Indepen— 
dent  variable  than  at  first  may  appear. 

IV.  TSOVECTORS  The  next  class  of  associated  vector  fields  are 
the  Isovectors,  or  generators  of  Lie's  infinitesimal  symmetries 
(the  same  as  treated  by  Ovslannikov  -  Ames,  Bluman  and  Cole,  etc.) 
In  our  language,  they  satisfy 


£v  I 


(4) 


and  can  readily  be  seen  to  form  a  group.  Now  one  must  Integrate 
a  homogeneous,  linear,  overdetermined  set  of  first  order  partial 
differential  equations  for  the  components  of  V.  This  can  be  done 
algorithmically  by  repeated  differentiation  (sici),  introducing  at 
each  step  more  functions,  of  fewer  variables.  Beautiful  and  effec¬ 
tive  symbolic  manipulation  programs  (in  the  language  REDUCE)  have 
recently  been  developed  for  this  at  Twente  University,  in  Holland, 
c.f.  recent  theses  by  P.  Gragert  and  P.  Kersten,  and  also  a  survey 
paper  with  their  professor  R.  Martini. 

V.  SIMILARITY  SOLUTIONS  We  can  now  precisely  define  a  simi¬ 
larity  solution  of  an  ideal  I  “  {<*^1,  belonging  to  an  Isovector 
V  of  I  (one  usually  writes  the  most  general  V  belonging  to  the 
entire  isogroup),  as  an  Integral  manifold  of  an  augmented  ideal 
I'  generated  by  forms  ul  and 

r  =■  {ui,  VJai}  (5) 


Assuming  that  I  was  complete,  that  is,  that  da^C  I,  It  is  imme¬ 
diately  calculable  that  I'  is  also  complete,  since  from  the  iden¬ 
tity  d(VJo^)  “  +  VJda^C  I'*  Moreover,  I"*  has  an  isovector, 

viz.,  V— we  have  been  able  consistently  to  Impose  it.  Thus  by 
Cartan's  theorem,  the  number  of  Independent  variables  in  I'  can 
be  reduced  by  one.  We  will  now  be  searching  for  a  (nonempty!) 
subset  of  solutions  of  I.  If  the  original  number  of  independent 
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variable  was  greater  than  two,  and  we  have  more  than  one  indepen¬ 
dent  isovector,  this  process  can  be  repeated  until  a  set  that  is 
readily  solvable  is  obtained,  or  until  a  set  of  ordinary  differen¬ 
tial  equation  is  at  hand*  Returning  to  the  above  example,  one 
verifies  that  the  following  is  an  an  isovector: 

V  =  4t  +  %  '  ®  If  ^ 

(This  could  easily  be  found  ad  hoc  by  any  applied  mathematician,  as 
it  generates  scaling  invariance,  so  we  repeat  that,  in  general,  one 
should  use  a  superposition  of  all  generators  of  the  isogroup!)  The 
Cartan  coordinate  reduction  allowed  by  Eq.  (6)  can  be  achieved  by 
introducing  new  Independent  variables  say  n  =»  p/t  (which  is  such 
that  fyti  a  0)  and  any  other  Independent  function  of  p,t.  The 
latter  then  drops  out  of  I"  and  one  finds  a  set  of  o.d.e. 's: 

U”(ti^  -  1)  +  U’(2n-^)  =  e'^^(fl')^(l  -n^) 

^  (7) 

n''(n^  -  1)  +  n’(2Ti  -  =  2U’fi'(n^  -  1) 


These  have  been  solved  by  quadrature  by  E.  Fischer. (5) 


In  the  same  paper,  Fischer  finds  another  reduction  of  Bq.  (2)  to  a 
set  of  o.d.e* 's.  As  is  in  fact  often  tried  in  relativity  theory, 
the  ansatz  that  U  and  0  depend  only  on  the  combination  n  P^  “  t^ 
also  works!  One  finds 


2nU" 

+ 

3U'  =• 

(8) 

2nD" 

11 

4nn’u' 

and,  again. 

these  are 

solvable  by  quadrature. 

But  the  resulting 

solutions  do  not  belong  to  an  infinitesimal  symmetry  of  1  as  given! 
What’s  going  on? 


VI.  GENERALIZED  SYMMETRIES  First,  it  must  be  said  that  the  des¬ 
cription  of  a  set  of  p.d.e.  's  by  a  differential  ideal  I  is  not 
unique*  It  may  even  be  possible  to  express  I  in  terms  of  fewer 
variables,  although  there  are  certain  criteria  for  I  to  be  "well 
set"  that  we  do  not  expound  here.  For  any  such  alternate  set,  a 
different  but  closely  related  ideal  can  result,  and  a  systematic 
understanding  of  how  its  Isogroup  is  built  into,  or  onto,  that 
first  found,  is  not  clear  to  us.  This  is  one  of  the  reasons  we 
only  claim  to  be  expounding  tools  for  applied  mathematicians. 

Prolongation  of  an  I  by  including  higher  partial  derivatives  has 
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been  considered  carefully  by  Anderson  and  IbragimovC^) ^  who  find, 
in  the  liinit  of  infinite  prolongation,  new  symmetries  of  "Lie-Back- 
lund"  type  related  to  sollton  transformations  and  other  newly 
appreciated  techniques  for  solution  generation. 


Prolongation  of  an  I  by  finding  conservation  laws,  and  generalized 
conservation  laws,  and  introducing  new  potential-like  variables,  has 
been  found  by  us^'^  to  be  equally  effective  in  finding  other  classes 
of  generalized  symetrles.  An  example  Is  the  Burgers  equation,  whose 
linearity  (an  infinite  number  of  isogroup  generators!)  is  not  found 
until  an  additional  potential  variable  is  Included  in  the  set  of 
p.d.e. ^s. 


Finally,  what  about  the  example  above,  at  the  end  of  section  V?  The 
ansatz  was  found  by  Fischer  to  result  from  a  "generalized  isovector" 
V  satisfying  the  equation: 


Mcr 


(9) 


This  is  still  sufficient  to  preserve  the  closure  of  I',  using  Eq. 
(5),  but  such  vectors  V  no  longer  form  a  group.  If  finding  similar¬ 
ity-type  special  solutions  is  all  one  wants,  this  is  in  principle 
no  problem.  What  may  be  a  practical  problem  is  that  the  equations 
(9),  (5),  for  V  are  no  longer  linear  in  its  components.  It  remains 
an  open  question  whether  such  generalized  isovectors  and  their 
similarity  solution  families  have  any  significant  relation  to  the 
other  approaches  to  generalized  symmetries  we  have  mentioned  above. 

*Res6arch  sponsored  by  the  U.S.  Army  Research  Office  through  an 
agreement  with  the  National  Aeronautics  and  Space  Administration. 
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APPLICATION  OF  RECIPROCAL  BACKLUND  TRANSFORMATIONS 
TO  STEFAN  PROBLEMS  IN  NONLINEAR  HEAT  CONDUCTION 

Colin  Rogers* 

School  of  Mathematics 
Georgia  Institute  of  Technology 
Atlanta,  Georgia  30332  USA 

ABSTRACT .  Reciprocal  Backlund  transformations  are  used  to 
investigate  both  one-phase  and  two-phase  Stefan  problems 
in  nonlinear  heat  conduction.  A  new  class  of  exact  solutions 
to  the  associated  nonlinear  moving  boundary  value  problems  is 
derived  which  is  analogous  to  that  obtained  by  Neumann  in 
linear  heat  conduction. 

I.  INTRODUCTION .  Storm  [1],  in  an  investigation  of  heat 
transport  in  simple  metals  showed  that  for  an  important  class 
of  such  materials  a  Backlund  transformation  may  be  introduced 
which  reduces  the  prevailing  nonlinear  heat  conduction  equa¬ 
tion  to  the  classical  1+1  heat  equation.  The  reduction  was 
used  to  solve  a  fixed  boundary  value  problem  involving  the 
temperature  distribution  in  a  half-space  with  an  insulated 
boundary.  It  has  been  shown  recently  that  the  Storm  trans¬ 
formation  may  be  set  in  the  context  of  a  class  of  reciprocal 
Backlund  transformations  which  allow  the  reduction  of  a  wide 
variety  of  nonlinear  boundary  value  problems  to  linear 

*Permanent  address:  Department  of  Applied  Mathematics, 
University  of  Waterloo,  Waterloo,  Ontario,  Canada. 
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canonical  form  [2,3].  Here,  both  one~phase  and  two-phase 
Stefan  problems  are  considered  for  materials  of  Storm-type. 

Such  moving  boundary  problems  arise  naturally  in  the  analysis 
of  melting  and  solidification  processes  [4].  Their  com¬ 
plexity  resides  in  the  fact  that  the  heat  balance  condition 
at  the  moving  interface  produces  a  nonlinear  boundary  condi¬ 
tion.  Here,  there  is  that  additional  complication  that  the 
heat  conduction  equations  considered  are  themselves  non¬ 
linear.  However,  with  attention  restricted  to  Storm-type 
materials,  it  is  shown  that  introduction  of  a  reciprocal  trans¬ 
formation  allows  the  construction  of  a  class  of  exact  solu¬ 
tions  analogous  to  the  classical  Nemnann  solutions  of  linear 
heat  conduction. 


II.  THE  RECIPROCAL  TRANSFORMATION.  In  what  follows,  we 
make  use  of  the  following  result  [5] : 

Theorem 

The  conservation  law 

~  {TO/3x;  3/3t;  u)  }  +  ^  {F(3/3x;  3/3t;  u)  }  =  0  (1) 

is  transformed  to  the  reciprocally  associated  conservation 
law 


3T*  3Fj^  _ 
3t*  3x* 

by  the  reciprocal  transformation: 


(2) 
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T*  = 


dx*  =  Tdx  -  Fdt,  t*  =  t 


F*  =  ~F(D*;9*;u) 
T(D* ;3*;u) 


T(D*;9*;u)  ' 


(3) 


where 


3  ;^*_3_F*3  a 

°  **”  3x  T*  Sx*"  '  *  3t  T*  ax*  3t* 


T(3/3x;  3/3t;  u)  s  T(u^ u.^ , ^xx ^ t '  ‘  ‘  ^ 


F(3/3x;  3/3t;  ii)  FCU  / u.^ / ^xx ' ^tt '  *  *  *  ^ 


The  reciprocal  nature  of  the  transfomation  resides  in  the 

2 

involutory  property  R  =1. 

In  particular,  the  above  result  shows  that  the  nonlinear 
equation 


n 


^  ^  [♦(u)  =  0 


(4) 


where  D  := 


■?:—  is  reducible  to  the  linear  canonical  form 


$ (u)  3x 

au* 

3t* 


3x^ 


n.  1  ^ 

r  r  3  U*  T  ^ 

[  Z  a.  - :r]  =  0 


i=l  ^  3x*^ 


(5) 


via  the  reciprocal  transformation 

n 

dx*  =  $(u)dx  +  (u)  2 

i=l 

u*  = 

In  the  sequel,  a  special 
the  analysis  of  a  class  of  Stefan  problems  in  nonlinear  heat 
conduction . 


(6) 


^>(u)  • 


case  of  this  result  is  used  in 
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III.  A  CLASS  OF  SINGLE  PHASE  STEPHAN  PROBLEMS  IN  NON- 


LINEAR  HEAT  CONDUCTION.  The  following  nonlinear  moving 
boundary  value  problem  is  considered: 


li  =  H  I?’' 

k(T)  =  U(t)  on  X  =  0,  t  >  0 
(3  X 


K (T)  1^  =  LpX(t) 


on  X  =  X(t) 


T  -  T 


X(0)  =  0‘ 


(7) 

(8) 

(9) 

(10) 


In  the  above,  T(x,t)  denotes  the  temperature  distribution  in 
a  medium  wherein  the  specific  heat  c  and  thermal  conductivity 
K  are  temperature  dependent.  The  density  p  of  the  material 
is  taken  to  be  constant.  L  denotes  the  latent  heat  of  fusion 
of  the  material  and  liberation  of  heat  is  envisaged  to  take 
place  during  a  phase  change  which  occurs  at  the  temperature 


If  we  set 


$(T)  -  /  S(ff)da,  S  =  pc  (T) 

T  ^ 

0 

then  the  nonlinear  heat  equation  (7)  may  be  written  as 


(11) 

(12) 


^  ^  (k(T)  111  =  0. 


<131 


The  reduction  of  the  previous  section  with 
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“i  = 


0  i  1 


i  =  1 ,  K*  >  0 


shows  that  (13)  is  taken  to  the  1+1-classical  heat  equation 


2 

3  T* 


via  the  reciprocal  transformation 


dx*  =  5>(T)dx  +  hc(T)  ^  dt,  t* 

d  X 


T*  = 


subject  only  to  the  condition 


$  (T) 


=  < (T) . 


The  applicability  of  the  condition  (17)  to  simple  metals  was 
discussed  in  [1] . 

Under  the  reciprocal  transformation  (16) 


=  <!>  (T)  , 


^  =  k(T)  ^  =  f  [k(t)  ^Idx  +  k(T)  ^ 

3t  3x  ^  3x  9x^^^  9x  x=0 


=  /  ['I'(T)]dx  +  U(t) 

0 


whence 


x*(x,t)  =  /  fdx  +  0(t)  -  0(0), 

0 


where  0(t)  =  U(t)  and  we  have  taken  x*(0,0)  =  0, 
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Accordingly,  the  fixed  boundary  condition  (8)  becomes 


9T* 

8x* 


=  -UT*  on  X*  =  0(t*)  -  e(0) 


Moreover, 


X 


[icfT)  -^l  dx  +  ic(T) 


X=x(t) 


=  /  ^  (T)  ]dx  +  LpX(t) 

X(t) 

SO  that  we  obtain  an  alternative  expression 


x*(x,t)  =  /  0(T)dx  +  [${T^)  +  Lp]X. 

X(t)  ^ 

Thus,  the  moving  boundary  conditions  (9)  become 


K*  =  -LpT*X 

T*  =  ^ 


on  X*  =  X* 


$  (T^) 


where 


X*  =  X* 


f=X(t)  "  +  Lp]X(t) 


SO  that  the  initial  condition  (10)  becomes 


X* 


t*=0 


Furthermore,  on  x*  =  X*  it  is  seen  that 


dX*/dt*  =  (l/T*)dX/dt  -  ic*T*^/T* 


so  that  the  heat  balance  boundary  condition  on  x*  =  x* 


(19) 


(20) 


(21) 


(22) 


(21) 


becomes 
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] dx*/dt* 


K* 


9T* 

ax* 


[ 


-LpT*^ 

1  +  LpT* 


-LpdX*/dt* 

$  (T^)  [$  (T^)  +  Lp] 


on  X*  =  X* 


(22) 


Thus,  to  summarize,  under  the  reciprocal  transformation  (16) , 
the  moving  boundary  value  problem  (7) -(10)  reduces,  subject 
to  the  Storm  condition  (17)  to 


aT* 

at* 


K* 


2 

a  T* 


ax*‘ 


t 


aT* 

ax* 


=  -UT*  on  X*  =  0(t*)  -  0(0) 


*  3T* 


LpdX*/dt* 


(23) 


ax’* 


T*  = 


$ (T^) [$ (T^)  +  Lp] 


^(T^) 


on  X*  =  X* 


X*(0)  =  0. 


Now, 


dx  =  T*dx*  +  k*T**  dt* 

X’' 

so  that 


=  T* 

ax*  ^ 


ax 

at* 


k*T^ 


X* 

/ 

X* 


a 

ax* 


(K*T**)dx*  +  ic*T** 

X*  X* 


x*=X* 


^f*  9T*  LpdX*/dt* 

at*  '  $(Tf)[$(T^)  +  Lp] 
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whence 


X* 

X  =  /  T*dx*  +  X*T*|  * 

X* 


LpX* 

$(Tj)  [$(T^)  +  Lp]  * 


Accordingly, 

x* 

x(x*,t*)  =  /  T*dx*  +  X*/[$(T^)  +  Lp]  . 
X*  ^ 

Alternatively, 


(24) 


3x 

9x* 


=  T* 


3x 

3t* 


X* 

/ 


3x* 


(K*T**)dx*  +  ic*T* 


^  |x=0 
X* 

f 

0  (t*)-0(O) 


3T^ 

3t^ 


X 


dx*  -  UT* 


X’ 


X" 


x=0 


x*=0(t*)-0(O) 


so  that 


X* 

x(x*,t*)  =  /  T*dx*  .  (25) 

0(t*)-0(O) 

Hence,  if  T*(x*,t*)  is  the  solution  of  the  reciprocal 
boundary  value  problem  (23)  then  the  solution  of  the  original 
problem  (7) -(10)  is  given  parametrically  by 


T  =  $"^(1/T*) 


X* 

X  =  /  T*dx* 

0 (t*)-0  (0) 


t  =  t* 


(26) 


while  the  evolution  of  the  boundary  x  =  X  is  given  in  terms  of 
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that  of  X*  =  X*  by  the  simple  relation  (22) . 

We  now  specialise  our  attention  to  the  class 
boundary  problems  (7)- (10)  with 

U(t)  =  Uq//E,  X(t)  =  /1ft 

and  introduce  the  similarity  variable 

=  x*//2Yt* 

into  the  reciprocal  problem  (23).  Solutions  are 
the  type 


so  that 


T*  =  (p*  (x*//2ft^) 


*  ^  +  K*  = 

J  ^  'I 


whence 


<p*  =  Aerf[^j^  K*]  +  B. 


The  linear  boundary  conditions  require  that 


K*  on  =  U^/T7f 


and 


(f)*  = 


f  (T^) 


on 


i*  =  $(T^)  +  Lp 


whence,  A  and  B  are  given  by 


-Uq/k*  Uq 

A,/—  e  =  -U.(Aer£[— 1  +  B)  , 

V  IT  °  /l^ 


of  moving 

(27) 

(28) 

(29) 

sought  of 

(30) 

(31) 

(32) 

(33) 

(34) 

(35) 
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(36) 


Aerf  [ 


(«(Tj)  +  Lo)]  +  B  =  . 


The  constant  y  which  determines  the  motion  of  the  boundary 
X  =  X(t)  is  obtained  from  the  remaining  boundary  condition 
which  provides  the  transcendental  equation 


A 


(0(T^)+Lp)^ 


-Lp/<D  (Tj)  . 


(37) 


The  solution  of  the  original  boundary  value  problem  is 
now  given  parametrically  by  the  relations 


T  =  $ 


-1 


[Aerf  (,/^  ^*)  t  B 


(38) 


5  =  f 


[Aerf ( 


2k* 


a)  +  B]da  . 


(39) 


It  is  noted  that  the  above  class  of  exact  solutions 
has  the  property  that  T  =  constant  =  Tq  on  the  boundary 
X  =  0  where 


$(T  )  =  - i -  .  (40) 

Aerf[UQ//i^]  +  B 

It  is  emphasised  that  the  above  analysis  is  valid  for  any 
member  of  the  class  of  materials  given  by  (17) .  It  repre¬ 
sents  an  extension  to  nonlinear  heat  conduction  of  the  clas¬ 
sical  Neumann  solution. 


IV.  TWO  PHASE  STEFAN  PROBLEMS  IN  NONLINEAR  HEAT  CONDUC¬ 
TION.  APPLICATION  OF  RECIPROCAL  TRANSFORMATIONS.  The  two- 
phase  Stefan  problem  to  be  considered  is  for  a  semi-infinite 
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region  x  >  0  with  phase  change  temperature  T^.  It  is  required 
to  determine  the  evolution  of  the  moving  phase  separation 
boundary  x  =  X(t)  and  teir^erature  distribution 


r 


T(x,t)  =  < 


T2(x,t)  >  0  <  X  <  X(t) 


^Tj^(x,t)  <  X(t)  <  X  <  <» 


(41) 


where 


!!i 

3t 


3T 


=  h  <  X  <  CO 


(42) 


•Cl  (TJ 


3T, 


l'"l'  3X 


3T 


=  LpX 


pc  o(T,) 


3T, 


p2'"2'  at 


^1  =  "^2  =  '^f 

rJ 


on  X  =  X(t) 


(43) 


0  <  X  <  X(t)  (44) 


3T 

'^2  ^"^2^  =  U(t)  on  X  =  0,  t  >  0 


(45) 


together  with  the  initial  conditions 


X(0)  =  0 


(46) 


T^(x,0)  =  Vq  <  ,  X  >  0 


(47) 


In  the  above,  the  T. (x,t) ,  c  . (T. ) ,  k. (T. )  i  =  1,2 

1  pi  1  11 

represent,  in  turn,  the  temperature  distribution,  specific 
heat  and  thermal  conductivity  in  the  two  phases.  The  sub¬ 
scripts  i  =  1,2  refer  to  the  new  and  original  phases  respec¬ 
tively.  In  this  problem  a  melting  process  is  envisaged  in 
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which  phase  1  is  solid  and  phase  2  is  liquid.  L  denotes  the 
latent  heat  of  fusion  of  the  medium.  Here  U(t)  denotes  the 
prescribed  flux  on  the  boundary  x  =  0  while  Vq  represents 
the  initial  temperature  of  the  medium.  It  is  noted  that  the 
analogous  two-phase  problem  in  linear  heat  conduction  has 
been  recently  investigated  by  Tarzia  [6] .  As  in  that  work  and 
in  the  single  phase  problem,  attention  is  restricted  to  the 
class  of  moving  boundary  problems  with 


u(t)  =  u.//t,  x(t)  =  /syt. 


(48) 

(49) 


If  we  now  set 

T.  =  $  (T.)  =  /  S. (a)da,  S.  =pc^. (T),  i  =  1,2  (50) 

X  X  -L  rp  -L  X  pi  X 

Oi 

then  (42)  and  (44)  yield 


T. 

1 


3^1  j-  9T^ 

"atT  “  93^  3x  ^  °  1  -  1,2  . 


(51) 


Our  attention  is  henceforth  confined  to  materials  for 
which  the  Storm  conditions 


=  k^(T^) 


i  =  1,2 


(52) 


apply,  where  i  =  1,2  are  positive  constants. 

Use  of  the  conditions  (52)  in  (51)  reduces  the  heat  con¬ 
duction  equations  in  the  two  phases  to  the  form 


aT^ 

TT 


-  K. 


_L 

ax 


1  *’'i 
_ ±.) 

i 


=  0,  i  =  1,2, 


(53) 


The  similarity  variable 
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5  =  x/  /2 


(54) 


IS  now 


introduced  and  solutions  of  (53)  are  sought  in  the  form 


Tf  =  (x/  /2  yt)  i  =  1,2 


(55) 


whence  (53)  yields 


d({) .  _  j 

Y?  -at 


1 

^  i)  =  0 


i  =  1,2. 


Under  the  reciprocal  transformation 


ae  =  dq 


R  (R^  =  I) 


())?  “  -T — 


(56) 


(57) 


(56)  produces  the  linear  canonical  form 

d^(})*  d(})* 

K.  - y  +  y5?  jTT  =0  i  =  1,2 


(58) 


with  solution 


i  =  1,2. 


(59) 


The  four  conditions 


=  T2  = 


9T, 


U 


ic^(T.) 


2  2  9x 


_0 

/t 


on  X  = 


on  X  = 


X(t) 

0,  t  >  0 


T^(x,0)  =  Vq 


produce,  in  turn,  four  equations  which  determine  the  A^, 
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i  =  1,2  namely 


(Tj)  ' 


A2erf [ 


ic, 

,\l-r 


Vt 


^2^  ®2  “  ' 


exp(-U^/K2)  =  -U^CAjerfCU^/^)  +  B^I 


\  ®1  <^i(Vq)  ' 


where 


(60) 

(61) 

(62) 

(63) 


\  =  q 


• 


^-1 


^2  =  ^2 


C=1 


(64) 

(65) 


The  interface  condition 
3T  3T 

“aiT  “  Tx  "  ^ 


yields 


d(()*  <2  d(|)^ 

^  "il  dq  " 


on  C  =  1 


whence 


-Al^l(Tf)yJ~ 

11  f  \ 


i  exp(-YAj/2Kj^) 


2  K. 

\ 

V  YTT 


^  exp(-YA2/2K2) 


=  Lp 


(66) 


The  latter  provides  a  transcendental  equation  for  the  constant 
Y  which  determines  the  motion  of  the  moving  boundary  x  = 

X(t)  =  /2^. 

The  required  temperature  distributions  T^^  and  T2  are 
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given  parametrically  by 


and 


^1  =  *1 


511  + 

q  _ 

i  =  f  {A,erf[  /-^  a]  +  }da  +  1 

X,  ^  V  2k^ 


(67) 


Tj  =  «2^{A2erft 


5| 


Y  >-  -i.  T  .  1 


•i-  5?]  +  B,) 


2  (c. 


(68) 


^  =  /  {A„erfl  /-^  a]  +  B  }da  . 

U„/27^  2  V  2-K2  ' 

The  quantities  X^  and  X^  are  given  by  the  relations 

Xi  -  X2  =  Lp  +  ^^(T^)  - 


(69) 


together  with 

X 

1  = 


Uq/27V 


A2erf [ 


a]da  +  B2  [X2 


U(,/2A1 


(70) 


References 

[1]  Storm,  M.  L.,  J.  Appl.  Phys.  940  (1951). 

[2]  Rogers,  C.,  J.  Phys.  A:  Math.  Gen.  L  493  (1983). 

[3]  Rogers,  C.,  J.  Math.  Phys.  2^,  393  (1985). 

[4]  Rubinstein,  L.  I.,  The  Stefan  Problem,  Trans.  Math. 
Monographs  ^  (Providence;  Amer.  Math.  Soc.)  (1971), 

[5]  Kingston,  J.  G.  and  Rogers,  C.,  Phys.  Lett.  A  9^,  261 
(1982)  . 

[6]  Tarzia,  D.  A.,  Quart.  Appl.  Math.  49^,  491  (1982). 


587 


ANALYSIS  OF  FLUID  EQUATIONS  BY  GROUP  METHODS 

1  2  3 

W.  F.  Ames  and  M.  C.  Nucci  ' 

School  of  Mathematics 

Georgia  Institute  of  Technology 

Atlanta,  GA  30332 


ABSTRACT.  Using  the  machinery  of  Lie  group  analysis 
several  equations  arising  in  fluid  mechanics  are  studied. 

In  particular,  the  Burgers*  equation  the  KdV  equation,  the 
Hopf  equation,  the  tv/o  dimensional  KdV  equation  and  the 
Lin-Tsien  equation  are  analyzed.  In  all  cases  the  parti¬ 
cular  group  includes  arbitrary  functions  of  time  which 
permit  the  transformation  of  time  dependent  equations  into 
the  corresponding  time  independent  ones.  Infinitely  many 
time  dependent  solutions  are  associated  with  each  steady 
solution.  Some  solutions  are  constructed. 

I.  INTRODUCTION.  Perhaps  the  most  widely  applicable 
method  for  determining  analytic  solutions  of  partial  dif¬ 
ferential  equations  utilizes  the  underlying  (Lie)  group 
structure.  The  mathematical  foundations  for  the  determination 
of  the  full  group  for  a  system  of  differential  equations 
can  be  found  in  Ames  [1] ,  Bluman  and  Cole  [2] ,  and  the  general 
theory  is  found  in  Ovsiannikov  [3].  The  determination  of  the 
full  group  requires  extremely  lengthy  calculations.  Detailed 
calculations  can  be  found  in  Ames  [1],  Ovsiannikov  [3]  and 
for  the  Navier  Stokes  equations  in  Boisvert  [4]  (see  also 
Boisvert,  et  al.  [5]).  Algebraic  programming  packages  for 
determining  these  groups  have  been  developed  by  Schwarz  using 
REDUCE  [9],  by  Roseneau  and  Schwarzmeier  using  MACSYMA  [10] 
and  CINO  in  Russia  (see  Ovsiannikov  [3],  p.  57).^  These 
programs,  while  very  versatile,  have  difficulties  in 
incorporating  arbitrary  functions  where  they  arise  in  the  Lie 
algebra.  These  arbitrary  functions  play  a  fundamental  role 
in  the  sequel. 

In  Boisvert,  et  al.  [5]  the  full  Lie  group  leaving  the 
Navier-Stokes  equations  invariant. 


^  n2 

u  +  uu  +  vu  +  wu„  =  -p^  +  yV  u,- 

t  X  y  z  A 


(1.1) 


^Research  supported  by  U.S.  Army  Grant  DAAG— 29-84— K  0083. 

^Permanent  address:  Dipartimento  di  Matematica,  Universita 
di  Perugia,  06100  Perugia,  Italy 

^Research  supported  by  a  NATO-CNR  fellowship. 
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2 

+  wv^  =  -Py  +  yV  V, 

(1.2) 

2 

+  ww_  =  -p  +  yV  w, 
z  z 

(1.3) 

+  w  =  0  , 

'  z 

(1.4) 

is  determined.  In  the  spirit  of  Lie  it  is  desired  to  find 
infinitesimal  transformations  of  the  form 


t' 

t 

+ 

eT(t,x,y,z 

,u,v,w,p) 

+ 

o(e^) , 

X' 

X 

+ 

eX (t,x,y,z 

,u,v,w,p) 

+ 

O(e^) , 

y' 

= 

y 

+ 

eY (t,x,y,2 

,u,v,w,p) 

+ 

O(e^)  , 

z' 

= 

z 

+ 

eZ (t,x,y,z 

,u,v,w,p) 

+ 

u' 

zs: 

u 

+ 

eU (t,x,y,2 

,u,v,w,p) 

+ 

O(e^) , 

v' 

= 

V 

+ 

eV (t,x,y,z 

,u,v,w,p) 

+ 

O(e^) , 

w' 

• 

w 

+ 

eW(t,x,y,z 

,u,v,w,p) 

+ 

O(e^)  , 

p’ 

p 

+ 

eP (t,x,y,z 

,u,v,w,p) 

+ 

0(£^)  , 

(1.5) 


which  leave  (1,1-1. 4)  invariant.  System  (1.5)  leaves  (1.1- 
1.4)  invariant  if  and  only  if  (u' , v' ,w' /p’ )  is  a  solution  of 
(1.1'-1.4')  whenever  (u,v,w,p)  is  a  solution  to  (l.l-jl.4). 

By  (1.1 '-1.4')  is  meant  the  same  equations  in  the  primed 
variables.  By  extensive  analysis  it  is  found  that  the  full 
Lie  group  leaving  (1.1-1. 4)  invariant  is  given  by  (1.5)  with 


T  =  a  +  23t, 

(1.6) 

X  = 

3x  -  yy  "  +  f  (t)  / 

(1.7) 

Y  = 

By  +  yx  -  az  +  g  (t)  , 

(1.8) 

Z  = 

3z  +  Xx  +  ay  +  h  (t)  , 

(1.9) 

U  = 

-Bu  -  yv  -  Xw  +  f ' (t) 

(1.10) 

V  = 

-Bv  +  yu  -  aw  +  g'  (t) 

(1.11) 

W  = 

-Bw  +  Xu  +  av  +  h'  (t) 

(1.12) 

+  j  (t)  -  xf "  (t)  -  yg"  (t)  “  zh"  (t) 

(1.13) 
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where  a/3/Y^X  and  a  are  five  arbitrary  parameters  and  f(t),  g(t), 
h(t)  and  j (t)  are  arbitrary,  sufficiently  smooth  functions 
of  t. 


The  arbitrary  functions  in  (1.7-1. 9)  permit  equations 
(1.1-1. 4)  to  be  transformed  into  their  time- independent  form. 
Thus  any  solution  of  the  steady  equations  generates  an  infinity 
of  time  dependent  solutions.  This  idea  was  exploited  in 
Boisvert,  et  al.  [5]  and  Nucci  [6]. 


II.  BURGER'S  EQUATION.  For  the  Burgers'  equation 

Ut  - 


(2.1) 


the  full  two  parameter  group  (a, 3)  with  one  arbitrary  func¬ 
tion  (f(t))  is 


T  =  a  +  2Bt,  X  =  3x  +  f  (t) ,  U  =  -3u  +  f ' (t) 


(2.2) 


With  a  =  If  3=0  the  subgroup 

T  =  1,  X  =  f  (t) ,  U  =  f  •  (t) 

has  the  generator 

0^  =  “• 
From  the  Lagrange  equations  of  (2.2)  we  have 

u  =  u  -  f  (t) 

X  =  X  -  F  (T)  , 


(2.3) 


(2.4) 


where  F'  =  f.  When  this  transformation  is  applied  to  (2.1) 
there  results 


uu- 

X 


^^xx 


(2.5) 


that  is  the  steady  equation.  One  integration  gives  the 
integrable  Riccati  equation 

U'  +  =  C,  (2.6) 


where  u  =  -2tiU.  Finally  setting  U  =  (2.6)  becomes 

—  Cij;  =  0- 

For  C  =  >  0  the  solution  of  (2.1)  is 
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u(x,t)  =  -2iia{  l-Rexp[-2a  (x-F  (t) )  ]  }/{  l+Rexp[-2a  (x-F  (t)  ]  }  +  f(t). 
2 

For  C  =  -a  <0  the  the  solution  for  (2,1)  is 


u(X/t) 


2ya 


cos [g  (x-F (t) )  ]  -  R  sin[a  (x-F (t) )  1  . 

sin[a (x-F(t) ) ]  +  R  cos [g (x-F (t) ) ] 


where  R  is  another  arbitrary  constant.  If  C  =  0  the  solution 
is 


u(x,t)  = 


2yR  -  (x-F(t)  ) 


+  f  (t)  . 


Ill,  THE  KORTEWEG  DE  VRIES  EQUATION.  Under  the  trans- 
forraation  (2.4)  the  KdV  equation  u^  +  uu^  =  '^xxx 

uu-  =  u — - 

X  XXX 

which  is  integrable  in  terms  of  elliptic  functions  since  one 
integration  gives 


U  -  U  =  C, 

XX 


when  u  =  2U. 


IV.  THE  EQUATION  u.  +  uu^  =  [(|)(u  )u  ]  .  Again  the 
action  of  (2.4)  transforms  the  equation  of  the  title  into 

V.  TWO  DIMENSIONAL  K-dV  EQUATION.  The  full  group  for 
the  two  dimensional  K-dV  equation  (Rogers  and  Chadwick  [7]) 


u  =  “U  -  gu  -  6u  -  6uu 

xxxx  xt  yy  X  xx 


(5.1) 


is  calculated,  using  the  notation  of  (1.5)  for  t,  x,  y,  and 
u,  to  be 

T  =  f  (t) 

X  ==  f  (t)x/3  -  f"(t)yV6a  -  g*  Ct)y/2g  +  h(t) 

Y  =  2f '  (t)y/3  +  g(t)  (5,2) 

U  =  -2f'  (t)u/3  +  f"(t)x/18  -  f"  (t)yV36g  -  g"(t)y/12a 

+  h'(t)/6. 

In  (5.2)  the  functions  f(t),  g(t)  and  h(t)  are  arbitrary  so 
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the  Lie  Algebra  is  infinite  dimensional  with  the  generator 

Moreover,  the  specific  choice  of  the  subgroup 
T  =  1,  X  =  -g'y/2a  +  h(t),  Y  =  g(t) 

U  =  -g"y/12a  +  hV6 

gives  rise  to  the  characteristic  variables 

y  -  y  =  G{t) 


(5.4) 


X  -  X  =  -gy/2a  +  ^{t) 
u  -  u  =  -g'y/12a  +  g^/24a  +  h/6  , 


(5.5) 


where  G'  =  g  and  fi'  =  <3^ /2a  +  h.  Under  (5.5)  equation  (5.1) 
becomes 


au--  +  6(u-)2  +  6UU--  +  u----  -  0, 


(5.6) 


that  is  the  time  independent  equation.  Each  solution  of 
(5.6)  gives  rise  to  a  family  of  solutions  involving  three 
arbitrary  functions  of  time,  f,  g  and  h,  when  u,  x,  and  y 
are  replaced  by  their  relations  from  (5.5).  To  illustrate 
this  idea  the  full  group  is  generated  for  equation  (5.6)  and 
some  exact  solutions  are  obtained  in  the  next  section. 

VI .  SOLUTIONS  FOR  EQUATION  (5.6).  The  full  group  for 
equation  (5.6)  (we  have  dropped  the  bars) ,  in  the  notation  of 
(1.5)  for  X,  y,  and  u  is 


X  =  c^x  +  C2 
y  =  2Cj^y  +  c. 
U  =  -2c^u, 


(6.1) 


v^ith  the  three  parameters  The  equation  for  the 

invariant  surface  are  obtained  from  (for  c^  0) 

u  =  F(n)/2cj^(2c^y  +  c^) 

.  1/2 

n  =  (c^x  +  c^)/{2c^y  +  c^)  / 

\1jj102re  F  satisfies  the  ordinary  differential  equation 


(6.2) 


c3p(iv)  2fp"  + 


aCj^n^F"  +  3(F')^  +  7c^anF'  +  Sac^^F  =  0.  (6.3) 
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(6.4) 


Two  solutions  of  equation  (6.3)  are 

F(n)  =  -*  4aCj^Ti^/3 


and 


F(n)  =  -4c^ti"^ 


(6.5) 


The  solution  of  equation  (5.1)  resulting  from  (6.4)  is 

,2 

h 


u  =  - 


2a{c^[x  +  ^  y  “  ^2^ 

3{2c^tY  -  G(t) ]  + 


and  that  resulting  from  (6.5)  is 

2 


2c; 


u  = 


{ci[x  +  ^  y  -  «(t)]  +  c^y 

2 


12^  y  +  ^  ^ 


V  +  ^  -  -I-  — 

12a  ^  24a  6 


where  =9  /24a  +  h  and  G*  ^ 

VII.  AN  INTERESTING  SUBGROUP  OF  (5>2)>  With  the  choice 
f(t)  =  t^,  g(t)  =  h(t)  =  0  in  (5.2)  the  group  becomes 

T  =  t^,  X  =  t^x  -  ty^/2,  Y  =  2t^Y 

U  =  -2t^u  +  tx/3  -  Y^/6a 


(7.1) 


From  (7.1)  the  equations  for  the  invariant  surface  are  found 
from 


n  =  y/t^f  5  =  x/t  +  Y^/2at^ 


(7.2) 


and 


u  =  5/6  -  n^t^/12a  +  F(n/?)/t^, 


(7.3) 


where  F(n/^)  satisfies  the  equation 


(7.4) 


But,  of  course,  this  is  equation  (5.6)  with  n  -  y^  5  -  x 
and  F  =  u.  Thus  two  solutions  are  available  -  i.e.. 


u(x,t)  =  - 


2  2.2 
2a[c^xt  +  Cj^Y  /2a  +  t  C2] 

3t^[2c3^Y  + 


and 
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[8] 


u(x,t)  =  ^  - 


2cy 


(c^xt  +  Cj^y^/2a  + 


VIII.  THE  LIN-TSIEN  EQUATION.  The  Lin-Tsien  equation 


2d)  +d)(j)  ~<t>  ~0»  (8. 

^tx  ^x^xx  ^yy 

where  tj)  is  a  velocity  potential,  has  been  used  to  study 
dynamic  transonic  flow  in  two  space  dimensions  (x  and  y) . 
The  full  group  is  known  (see  Ovsiannikov  [3,  p.  388])  to  be 

X  =  4ax/3  +  g'(t)y  +  w(t) 


i  ^  2a<p  +  2y^g"'/3 


+ 


Y  =  ay  +  g(t) 


(8.2) 


T  «  2at/3  +  B 

2y^w"  +  2xyg"  +  2xw'  +  yr(t)  +  s(t). 


Equations  (8.2)  contain  two  arbitrary  contants,  a  and  B, 
and  four  arbitrary  functions  g(t),  w(t) ,  r(t)  and  s(t).  Con¬ 
sequently,  the  Lie  Algebra  is  infinite  dimensional.  Once 
again  these  arbitrary  functions  will  permit  an  infinite  num¬ 
ber  of  time  dependent  solutions  to  be  generated  from  each 
steady  state  solution. 

The  invariants  of  the  group  that  are  constant  in  time 
(with  a  =  0)  are  found  as  before  to  be 

x-x=gy-A,  X  =  /(g^  +  w)dt 

y  -  y  =  Q,  Q  =  /g(t)dt 

and 

<J>  -  ^  =  2g'xy  +  2g''y^/3  +  2wx 

+  /{2g"[gy^  +  2gQy  -  Xy  +  gQ^  -  XQ] 

+  j  g'"  [3y  Q  +  3yQ  +  Q  1  +  2w '  [yg  +  gQ  -  X  ] 

+  2w"  [2yQ  +  Q^]  +  ry  +  s}dt 

It  is  easy  to  show  that  ^(x,y)  satisfies  the  time 
independent  equation 

(b  — (h— —  —  (b— —  =  0  (8.4) 

^x^xx  ^yy 

which  has  been  much  studied.  Given  any  solution  of  (8.4)  it 
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follows  that  time  dependent  solutions  (with  four  arbitrary 
functions  of  time  g,  w,  r  and  s)  are  constructable  —  thus 

<j)(x,y,t)  =  ^(x  -  g(t)y  +  X,  y  -  Q) 

+  (right  hand  side  of  the  last  equation  in  (8.3)). 
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F--P-S  POINCARE^LIKE  LINEARIZATION  APPLIED  TO  SOLITON  EQUATIONS 


R,L,  Anderson 
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Switzerland 

ABSTRACT .  The  program  of  E-P-S  Poincare-like  linearization  and  the 
results  of  its  application  to  soliton  equations  are  described. 

I.  Introduction.  The  results  announced  in  [l]  strongly  suggest  that 
soliton  equations  are  F-P-S  Poincare-like  linearizable  [5].  This  was  further 
supported  by  our  work  [2,3,4].  The  purpose  of  this  paper  is  to  describe 
heuristically  the  F-P-S  Poincare-^like  linearization  program  (§11)  and  some  of 
its  implications  (see  last  section  of  §11),  as  well  as  describe  the  results 
obtained  to  date  from  its  application  to  soliton  equations  (§III). 

One  important  reason  for  continued  physical  interest  in  soliton  equations 
such  as  the  Korteweg  de  Vries  (KdV) ,  nonlinear  Schrodinger,  Boussinesq,  and 
the  Benjamin-Ono  (B,0.)  equations  is  that  they  are  the  lowest  order  nonlinear 
equations  in  multiscale  approximations  to  the  equations  of  one-dimensional 
fluid  flow  for  various  physical  configurations  and  certain  initial  data.  They 
are  of  interest  in  their  own  right  because  they  possess  an  infinite- 
dimensional  Lie  symmetry  structure  which  can  be  used  to  generate  hierarchies 
of  constants  of  motion  and  classes  of  soliton  solutions.  They  exhibit 
particle-like  properties.  Further,  it  is  known  that  many  soliton  equations 
are  particular  cases  of  completely  integrable  infinite-dimensional 
Hamiltonian  systems  [6,7].  There  are  various  effective  approaches  to  studying 
such  equations;  e.g.,  1ST  Scheme  [8,9,10,11];  Riemann  problem  for  matrices 
[12];  Method  of  Prolongation  Structures  [13]  (the  reader  is  also  referred  to 
the  presentation  by  Estabrook  in  these  proceedings);  Kac-Moody  Lie  algebraic 
methods  [14],  (see  [14]  for  the  references  to  the  work  of  the  Kyoto  School.), 
loop  groups  [15],  Riemann  surfaces  and  theta  functions  [16].  What  is  the 
justification  for  the  interest  in  all  these  approaches  and,  as  we  advocate, 
even  one  more?  To  date  there  does  not  exist  an  algorithmic  method  for 
determining  whether  a  given  equation  is  of  the  soliton  type;  rather  all  of  the 
above  provide  methods  for  constructing  soliton  equations.  In  the  case  of  the 
Method  of  Prolongation  Structures  it  also  provides  a  way  for  searching  for 
structures  such  as  symmetry.  Lax  pairs,  Backlund  transformaions  for  a  given 
equation,  if  they  exist.  We  have  approached  this  general  area  with  two  broad 
goals  in  mind.  One  is  to  realize  the  implications  of  the  F-P-S  Poincare 
linearization  program  cited  at  the  end  of  §11.  This  includes,  of  course, 
solving  the  evolution  equation  in  question.  The  second  goal  is  to  use  this 
program  as  a  tool  to  try  to  determine  algorithmically  whether  a  given  equation 
is  a  soliton  equation  or  not.  In  §III  we  describe  our  results  to  date  which 
show  some  progress  towards  these  goals. 
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II-  F-P-S  Poincare-like  linearization.  The  program  of  the  classifi” 
cation  of  nonlinear  representations  of  Lie  groups  and  Lie  algebras  of  Plato- 
Plnczon-Simon  [5,17]  can  be  regarded  as  a  generalization  of  Poincare's  program 
of  transforming  vector  fields  to  normal  forms.  Here  we  shall  briefly  describe 
this  transformation  for  the  case  when  there  exists  an  equivalent  linear  normal 
form  in  the  neighborhood  of  the  origin.  Further,  we  shall  confine  our 
discussion  to  the  following  formulation  which  is  sufficient  for  introducing 
the  general  ideas.  Consider  a  nonlinear  analytic  vector  field  T  with  Taylor 
expansion  about  the  origin  of  the  form  T  =  J  where  is  the  n^^  order 

term,  which 

n^l 

describes  a  nonlinear  ordinary  differential  equation  (ODE)  of  the  form 
du/dt  -  T(u)  ,  t  G  R,  u  ^  .  We  look  for  ways  to  transform  this 

vector  field  (this  ODE)  to  a  simpler  form.  That  is  the  general  problem.  The 
problem  we  are  interested  in  here  is  to  determine  whether  or  not  this  vector 
field  (ODE)  can  be  transformed  to  its  linear  part  through  the  use  of  power 
series  substitutions-  Explicitly,  we  seek  to  determine  whether  or  not  there 
exists  an  analytic  map  A  ,  acting  on  some  neighborhood  of  the  origin,  such  that 


DA-tT]  =  o  A  ,  (II. 1) 

where  DA[T]  Is  the  Frechet  derivative  of  A  along  the  direction  T  and  o 

denotes  composition  of  maps.  Such  a  map  A:  u  — ^  v  takes  solutions  of 

du/dt  -  T(u)  into  solutions  of  dv/dt  -  T^(v)  -  In  the  Taylor  expansions  T^ 

and  A^  can  be  chosen  to  be  symmetric  n-linear  maps.  Therefore  substitution 
of  the  formal  power  series  representations  for  T  and  A  into  (II. 1)  yields 
the  following  equations 

[tI.A"]*  =  I  AP(Iq  0  T^-P+l  0  lp.q.i)(Jn.  (II-2) 

l<p<n-l , 0<q<p-l 

where  [T^.A"]*  =  Dt1*[A«]  -  DAn[Tl]  ,  Iq  =  q-fold  tensor  product  of  the 
identity  operator,  and  is  the  symmetrization  operator  on  the  n-fold 

tensor  product.  (The  tensor  product  appears  because  it  is  possible  and  useful 
to  consider  linear  maps  rather  than  n-linear  maps  at  each  order) ,  Such  a  map 
is  called  a  linearization  map. 

An  analogous  discussion  for  a  map  B  which  takes  T^  into  T  under 
the  same  assumptions  yields  the  following  equations 

=  X  TP  o  0  ...  0  B^P)  .  (II. 3) 

llp^n-lji^^t.  .  .+ip-n-l 

The  map  B  is  called  an  inverse  linearization  map  and  it  is  this  map  which 
underlies  the  usual  perturbation  methods.  In  fact,  it  is  somewhat  surprising 
that  this  approach  of  Poincare  to  perturbation  computations  is  not  more 
commonly  used.  This  is  what  is  termed  Poincare  linearization  and  we  now 
proceed  to  set  it  in  a  group  theoretical  context  and  then  describe  its 
generalization  a  la  Flato-Pinczon-Simon  to  what  is  termed  F-P-S  Poincare-like 
linearization. 

Equation  (II. 2)  can  be  given  the  following  group  theoretical  inter^ 
pretation.  Consider  the  local  group  action  obtained  by  integrating  the  vector 

field  T(T^)  to  where  t  is  the  group  parameter.  Then  (II-2) 

is  the  Lie  algebraic  version  of  the  problem  of  the  equivalence  of  the 
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one  parameter  group  U^-  to  the  linear  group  U  ,  i,e,  does  there  exist 

1  ^ 

an  analytic  map  A  intertwining  U^.  and  , 

A  o  Ut  =  U^o  A  .  (II. 4) 

^  t 

We  remind  the  reader  that  U^-  and  A  appearing  in  equation  (11*4)  are 
nonlinear  maps* 

It  is  this  part  of  the  program  Poincare  linearization  which  Flato- 
Pinczon-Simon  generalized  to  classify  the  actions  of  nonlinear  representations 
of  Lie  groups  and  Lie  algebras  on,  in  general,  inf inite-dimensional  spaces. 

In  what  is  termed  F-P^S  Poincare-like  linearization  a  given  nonlinear 
evolution  equation  on  a  given  space  of  initial  conditions  defines  a 
one-parameter  time-translation  subgroup  of  a  (larger)  covariance  group  (such 
as  the  Galilean  group  or  Poincare  group)  of  the  equation  in  question.  A 
choice  of  a  covariance  (symmetry)  group  is  naturally  suggested  by  the  physical 
context  in  which  the  equation  arises.  The  existence  of  an  explicit  invertible 
analytic  linearization  map  A  from  a  space  of  initial  conditions  for  the 
nonlinear  equation  to  one  for  the  linear  equation  which  takes  the  nonlinear 
representation  of  the  covariance  algebra  into  its  linear  part  is  Investigated 
by  analyzing  the  linearization  algorithm  (11.2)  and  the  inverse  linearization 
algorithm  (II. 3).  This  turns  out  to  be  a  problem  of  cohomology,  the  details 
of  which  we  omit  here.  We  shall  illustrate  F-P-S  Poincare-like  linearization 
with  an  application  to  the  B*0.  equation. 

The  Benjamin-Ono  (B.O.)  equation  can  be  written  in  the  form 

du(t)/dt  =  T^(u(t)),  u(t)  €  E,t  e  I  ,  (II-5) 

where  Tq  is  one  generator  of  the  nonlinear  representation  3  (a,b) 

hTq  +  bTj^  of  the  commutative  space-time  Lie  algebra  .  Here  for 

w  6  E:  To  =  +  tJ ,  =  ij,  tJ(w)  =  -a2Hw,  Tq(w)  =  -3(w)2  , 

T^(w)  =  9w  ,  (9w)(x)  =  dw(x)/dx  and  (Hw)(x)  =  P.V.  /dyw(y)  (y-x)"l7r“l 

is  the  Hilbert  transform*  E  stands  for  various  spaces,  which  in  our  work  are 
all  subspaces  of  Sobolev  spaces  W^’^  ,  the  space  of  functions  which  are  in 
together  with  all  their  derivatives,  up  to  order  n  .  Its  linear 
part  is  given  by 

dv(t)/dt  =  Tj(v(t)),  v(t)  G  E,  t  G  1  .  (II. 6) 

Suppose  for  a  moment  that  the  evolution  operator  (resp.  U^)  associated 
with  the  Equation  (II. 5)  (resp.  (II. 6))  exists,  for  each  time  t  ,  on  a  space 
E  of  initial  condition,  i*e.,  if  uq  (resp.  vq)  is  an  initial  condition  for 
equation  (11,5)  (resp.  (II-6))  at  t  =  0  ,  then  the  solution  of  Equation 
(II. 5)  (resp.  (11,6))  Is 

u(t)  =  (resp.  v(t)  -  ^^(Vq))  .  (II. 7) 

Equation  (11,7)  is  the  one-parameter  time-translation  group,  with  group  para¬ 
meter  t  ,  defined  by  the  B.O.  (linear  B.O.)  equation.  It  is  sufficient  for  our 
purpose  to  consider  this  group  as  a  one-parameter  subgroup  of  a  larger  covar- 
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[U 


"  [U^(uQ)](x+a)  and  (Vq) ] (x)  =  [U^(Vq) ] (x+a) 


The  tnap,  from  R*-  into  functions  on 

(t,a)  - >  U 


(t,a) 


(reap. 


uL  0 

(t.a) 


(II. 8) 


defines  a  nonlinear  (reap,  linear)  representation  of  the  commutative  space- 
time  covariance  (symmetry)  group  on  E  ,  i.e.. 


"(t.a)  °  “<f  ,a')  “  ''(t.t'.a.a') 


^(t,a)  °  *^(t’,a')  ^(t+t' ,a+a' 


The  solution  of  (11,2)  and  (II ,3)  for  the  generators  Tq  and  7^  of  the 
group  subject  to  the  condition  B  o  A  -  idg  ,  yields  an  invertible 

linearization  map  A  with  B  =  A“^  which  linearizes  (II:8). 

The  implications  of  the  existence  of  a  solution  B  of  Equations  (II.  3) 
are  several.  This  solves  one  of  the  most  fundamental  problems  about  an 
evolution  equation,  namely,  the  initial  value  problem  for  the  nonlinear 
equation  on  the  set  of  initial  conditions  {uq  G  E | uq  =  B(vq)}  for  some 
VQ  ^  E  by 


u(t)  =  B(v(t)),  UQ  =  B(vo)  (11. 9) 

where  v(t)  is  the  solution  of  the  linear  part  with  initial  condition  vq  . 
Also  other  questions  for  the  nonlinear  evolution  equation,  such  as  the 
existence  of  superposition  principles,  hierarchies  of  ^higher  order ^  evolution 
equations  (or  symmetries),  and  infinite  sequence  of  constants  of  motion  are 
reduced  to  the  corresponding  questions  for  the  linear  equation  when  A  and  B 
exist-  We  want  to  stress  that  we  have  not,  in  this  heuristic  discussion, 
considered  the  mathematically  important  and  technical  question  of  domains  for 
A  and  B  -  Some  of  these  points,  as  well  as  the  B-0.  equation  itself,  are 
discussed  in  more  detail  in  §III  and  in  the  papers  referenced  there. 

Ill-  Description  of  results.  It  was  known,  since  the  discovery  of  the 
1ST  scheme  for  the  KdV  equation  [8],  how  to  construct  solutions  of  the  KdV 
equation  from  a  certain  class  of  solutions  of  the  linear  part.  However,  it 
was  not  clear  how  the  Cauchy  problem  for  the  KdV  equation  on  a  given  space  of 
initial  conditions  reduces  to  one  for  its  linear  part.  In  [l]  it  was  first 
shown  how  this  reduction  takes  place  by  showing  that  on  the  space  of  initial 
conditions 


S^^(£)  *  {f  e  c“(E)|llfl|  =  sup  |(l+|x|)“a^f(x)|  <  ®,  N  »  0,1,...} 

xG]-<o,n] 

0^k<N 

the  Cauchy  problem  for  the  KdV  equation  can  be  solved  entirely  by  the  F-P-S 
Poincare-like  linearization  program  described  in  §11.  Specifically,  on  the 
space  of  initial  data  ,  a  translation-invariant  invertible  analytic  linear¬ 
ization  map  A  :  ^  A[S|5]  C  was  concretely  realized,  (Technically  in 

this  case  A  is  unique  and  converges  to  an  entire  analytic  function  on  , 

and  is  invertible,)  The  problem  of  the  existence  of  global  (in  time) 
solutions  and  the  existence  of  solutions  which  are  not  global  (in  time)  was 
solved  for  the  space  *  In  particular,  as  was  earlier  established  in  [18], 

the  existence  of  global  solutions  for  the  KdV  equation  for  all  initial 
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conditions  in  S(]t)  ,  the  Schwartz  space  of  rapidly  decreasing  function  on 
the  real  line  E  ,  was  established-  Here  the  fact  that  S(E)  C  S^(E) 
was  used,  as  well  as  a  separate  fact  [18],  namely,  A[S(E)]  is  invariant 
under  the  linear  evolution-  A  nonlinear  superposition  principle  for  the  KdV 
equation  was  also  obtained  from  the  result  that  A[Sb]  is  convex. 

This  program  was  next  applied  to  Burgers  equation.  Although  Burgers 
equation  is  not  a  soliton  equation  it  shares  many  properties  with  soliton 
equations  and  is  an  important  nonsoliton  equation.  Because  of  the  existence 
of  the  Cole-Hopf  transformation,  it  was  well  known  that  Burgers  equation 
linearizes  to  its  linear  part.  However,  the  application  of  the  F-P-S 
Poincare-like  linearization  program  to  Burgers  equation  led  to  new  results 
concerning  its  constants  of  motion  and  Hamiltonian  structure.  For 
completeness  we  shall  discuss  the  nature  of  the  linearization  map  obtained  in 
[19]  as  well  as  the  other  results . 

In  [19]  Burgers  equation  was  linearized  on  the  space  of  initial  condition 
S(E)  .  Specif ically ,  a  translation-invariant  invertible  analytic 
linearization  map  A:S  ^  A[S]  C  S  was  concretely  realized.  (Technically  A 
converges  to  an  entire  analytic  function  on  S  .  The  Inverse  A“^:A[S]  ^  S  is 
analytic  on  the  image  A[S])  .  A“^  is  close  to  the  Cole-Hopt-transformation. 

It  was  shown  that  A[S]  is  invariant  under  the  linear  evolution,  hence  the 
existence  of  global  solutions  for  the  Burgers  equation  for  all  initial 
conditions  in  S(E)  (or  L^(E))  was  established.  Further,  it  was 
established  that  Burgers  equation  has  an  infinity  of  constants  of  motion. 

This  fact  was  not  commonly  thought  to  be  true.  It  was  also  shown  that  A[S] 
is  a  convex  subset  of  S  ,  hence  the  existance  of  a  nonlinear  superposition 
principle  for  Burgers  equation  was  established-  It  was  shown  that  the 
commutative  Lie  algebra  of  higher-order  Burgers  equations  is  the  image  of  the 
positive  powers  of  the  translation  operator  under  A"*^  .  Thus  the  enveloping 
algebra  of  the  translation  operator  for  the  linear  part  is  the  source  of  the 
Burgers  hierarchy.  It  was  also  shown  that  while  Burgers*  equation  is 
dissipative  it  can  be  defined  by  a  completely  integrable  Hamiltonian  system. 

The  above  result  on  the  connection  between  the  hierarchy  of  higher-order 
Burgers  equations  and  the  powers  of  the  translation  operator  generalizes  and 
the  analogous  result  for  the  KdV  equation  was  established  earlier  in  [18]  by 
analyzing  the  direct  and  inverse  scattering  maps  in  the  1ST  scheme.  In  this 
latter  work  the  question  of  determining  a  complete  set  of  constants  of  motion 
for  the  KdV  was  solved.  Specifically,  it  was  shown  that  the  Lie  algebra 
spanned  by  certain  polynomials  in  the  generators  of  a  solvable  Lie  algebra  of 
dimension  three  for  the  linear  part  underlies  each  constant  of  motion  for  the 
KdV  equation. 

In  [3]  it  was  shown,  by  using  examples,  that  the  1ST  scheme,  Hirota  Xjg 
formalism,  and  the  Kac-Moody  constructions  of  the  Kyoto  School  all  yield  the 
same  inverse  linearization  operator  in  the  sense  described  in  §11.  Hence, 
they  solve  the  Cauchy  problem  for  the  same  set  of  initial  conditions. 
Specifically,  in  the  case  of  a  nonlinear  evolution  equation  solvable  by  the 
1ST  scheme,  iteration  of  the  associated  Gelfand-Levitan-Marchencko  (G-L-M) 
equation  yields  the  Taylor  series  representation  of  an  inverse  linearization 
map.  For  example,  following  the  approach  in  [l],  formulas  (3.11),  (4.02), 
and  (4.14)  of  Rosales  [20]  can  be  given  a  rigorous  mathematical  meaning  and 
hence  represent  analytic  solutions  to  the  cohomological  equations  for  A^^ ; 
for  the  KdV,  MKdV  sine-Gordon,  and  nonlinear  Schrddlnger  equations, 
respectively.  The  Kadomstev-Petviashvili  (K-P)  equation  turns  out  to  be 
another  example.  It  was  shown  in  the  case  of  Hirota ’s  formalism  (or 

dependent  variable  substitution  method)  that  it  contains  the  essential 


601 


Information  for  the  construction  of  inverse  linearization  maps.  This  is 
Illustrated  for  the  K-P  equation.  To  go  from  the  N-soliton  result  to  , 

one  simply  writes  the  N-soliton  solution  in  terms  of  N  solutions  of  the 
linear  part  of  the  equation.  Then  a  generalization  of  this  yields  for 

more  general  initial  conditions-  Since  the  Kac-Moody  construction  of  the 
Kyoto  School  in  general  subsumes  the  results  obtained  via  Hirota's  Xjj 
formalism  a  similar  construction  yields  A""^  for  the  whole  hierarchy.  This  is 
illustrated  for  a  sine^Gordon  hierarchy.  (The  latter  example  technically  uses 
a  generalization  of  the  Kyoto  School  construction.  This  generalization  is 
under  separate  investigation.)  Given  the  central  role  the  K-P  hierarchy,  its 
variations,  and  reductions,  play  in  the  extensive  results  of  the  Kyoto  School, 
the  class  of  soliton  equations  which  are  F“P-S  Poincare-like  linearizable  is 
quite  large. 

In  [2]  the  cohomological  equations  for  an  Inverse  linearization  map  for 
the  Benjamin-Ono  equation  were  explicitly  solved.  A  domain  of  convergence  for 
this  B  (B^,  in  the  language  of  [4]),  which  is  invariant  under  the  linear 
evolution,  was  identified.  The  resulting  global  solitonless  solutions  for  the 
B.O.  equation  are  for  each  time  t  decreasing  at  least  as  x”^  but  not 
faster  than  in  the  space  variable  x  for  nontrivial  initial  data. 

Beyond  its  physical  significance,  this  soliton  equation  was  and  still  is 
extremely  important  in  its  own  right  because  it  has  a  dispersion  relation 
(o(p)  =  IpIp,  p  G  £  for  its  linear  part  which  is  not  analytic.  (This 
dispersion  relation  has  a  certain  similarity  to  the  dispersion  relation  for 
the  linear  wave  equation.) 

The  above  analysis  of  the  B.O.  equation  was  continued  in  [4].  There,  we 
extended  the  map  found  in  [2]  to  include  solitons.  Specifically,  we  have 

explicitly  solved  the  Cauchy  problem  for  certain  initial  data  close  to  pure 
n-soliton  data  (Th  2.3,  3.1,  [4]).  This  last  result  followed  from  an  analysis 
of  the  cohomological  equations  for  the  linearization  maps  A  -  These  latter 
equations  were  explicitly  solved  for  a  space-time  covariant  map  A^  .  The  map 
Aq  was  shown  to  annihilate  pure  multi-solitons .  Therefore  in  order  to  treat 
initial  conditions  which  include  multi-soliton  components,  an  analysis  of  an 
associated  linear  problem  which  was  constructed  from  a  special  property  of  A^, 
which  we  loosely  call  its  recursivity,  led  to  the  deformation  of  the  map  A^. 
into  an  analytic  linearization  map  A  which  exists  on  a  well-defined  space  of 
initial  conditions  (Th,  2.3  [4])  and  is  one-to-one  on  pure  multi-solitons.  The 
main  technical  problem  remaining  for  solving  explicitly  the  Cauchy  problem  for 
all  initial  conditions  in  any  one  of  the  spaces  appearing  in  our  analysis,  is 
to  characterize  the  elements  in  the  image  of  A  -  A  comparison  of  these 
results  with  the  work  of  Fokas  and  Ablowitz  [21]  was  made  in  [4],  Further 
comparison  is  actively  being  pursued  by  one  of  the  authors  (RLA)  with  the 
authors  of  [ 21 ] - 

An  interesting  result  concerns  the  property  we  referred  to  in  the 
preceding  paragraph  as  the  recursivity  of  the  linearization  (inverse 
linearization)  maps.  While  we  do  not  have  a  precise  definition  of  what  we 
mean  by  recursivity,  computationally  we  identify  this  property  in  a  given  case 
by  the  simplification  of  the  division  problem  which  appears  in  the  solution  of 
the  cohomological  equations-  While  in  all  cases  A^  is  given  in  terms  of  all 
the  AJ  ,  j  <  n  ,  and  similarly  for  it  happens  in  these  cases  that  they 

are  given  by  iterating  a  few  elementary  operations.  The  recursivity  of  leads 

naturally  in  the  case  of  the  B-0.  equation  to  the  identification  of  the 
Fourier  transform  of  A^,  with  the  distorted  Fourier  transform  [22]  associated 
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with  a  self  adjoint  operator  which  is  the  x-member  of  a  Lax  pair.  This  leads 
to  a  Lax  pair  which  in  the  case  of  the  positive  frequency  part  is  a  particular 
case  of  the  Lax^pair  of  Bock  and  Kruskal  [23]*  (This  also  leads  directly  to 
an  explicit  construction  of  an  infinite  hierarchy  of  constants  of  motion.) 

The  recursivity  of  leads  to  a  Gelfand-Levitan-like  equation. 

With  respect  to  the  latter  work  on  the  B.O*  equation,  it  is  perhaps 
worthwhile  to  end  by  pointing  out  that  the  scattering  operator  for  the  Lax 
pair  we  constructed  is  a  constant  of  motion,  therefore  our  solution  for  the 
B.O*  equation  is  a  non-IST  one.  Further,  the  perturbation  approach  employed 
by  Rosales  [20]  contains  an  ansatz  which  is  implicitly  a  particular  case  of 
the  algebraic  part  (i.e.  no  topological  considerations)  of  space  translation 
invariant  formal  F-P-S  Poincare-like  inverse  linearization.  In  light  of  our 
result  for  the  B.O.  equation,  namely  A  and  B  are  not  space-translationally 
invariant.  His  approach  will  not  yield  without  conceptual  modification  the 
solitonic  part  of  B  for  the  B.O.  equation.  Another  much  simpler  example  of 
a  formally  linearizable  equation  (with  a  covariance  group  reduced  to  the 
time-translations)  ,  which  is  not  treatable  by  Rosales*  approach,  is 


af  u(t,x) 


+  (u(t,x))  .  In  this  context,  we  remind  the  reader  that 


the  general  F-P-S  problem  is  the  classification  of  nonlinear  representations 
of  Lie  groups  into  normal  forms  where  linearizable  representations  belong  to 
the  trivial  equivalence  class.  The  F-P^S  Poincare-like  linearizability  of  a 
given  evolution  equations  therefore  depends  critically  on  the  covariance  group 
considered  as  illustrated  by  our  last  example. 
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ABSTRACT ■  A  gas  dynamic  model  of  the  pellet  fusion  pro¬ 
cess  having  a  time-invariant  source  term  is  studied.  Under 
appropriate  assximptions  a  nonlinear  system  of  three  partial 
differential  equations  results.  A  group  analysis  is  per¬ 
formed  on  these  equations  and  the  family  of  one  parameter 
transformation  groups,  which  leave  the  equations  invariant, 
is  derived.  Exploiting  these  invariants,  for  some  particular 
values  of  the  parameters  involved  in  the  equations,  closed 
form  solutions  are  found. 

I .  INTRODUCTION .  In  the  pellet  fusion  process,  a 
spherical  pellet  is  fired  into  a  containment  chamber  and  then 
bombarded  with  pulses  of  laser  energy.  When  hit  by  a  laser 
pulse,  a  shock  wave  propagates  through  the  pellet  as  the 
temperature  rises.  [A  typical  pellet  configuration  is  sketched 
below.  ] 


Fig.  1.  A  Typical  Pellet  Configuration. 

The  outermost  layer  of  the  pellet,  consisting  of  lead, 
prevents  the  outward  expansion  of  the  inner  layers;  hence, 
as  the  Lithium- Lead  layer  heats  up,  it  expands  inward  forcing 
the  DT  fuel  toward  the  center. 
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The  shock  wave,  after  passing  through  to  the  inner  layer, 
is  reflected.  On  striking  the  outer  surface  of  the  pellet, 
another  pulse  arrives  and  constructively  adds  to  the  shock 
wave  already  in  the  pellet. 

The  process  continues  until  the  concentration  of  fuel 
and  the  temperature,  at  the  center,  is  high  enough  for  the 
fusion  reaction  to  take  place . 

II.  THE  MODEL.  By  applying  the  principles  of  gas 
dynamics  and  exploiting  the  spherical  symmetry  of  the  problem, 
we  obtain  the  following  equations  describing  the  process; 


H  ^  +  P  I?  ^  “  I?  =  0 

(Conservation  of  Mass) 


(1) 


+  u  |a  +  RTp-1  II  .  R  II  = 
(Conservation  of  Momentum) 


(2) 


9T 

+  u 

9t 

where  p(r,t) 

is 

u(r, t) 

is 

T(r,t) 

is 

r 

is 

t 

is 

R 

is 

Y 

is 

n 

r 

is 

3T 

3r 


,-i  .  (y-i)t|H 


=  r 


n 


(Conservation  of  Energy) 


the  radial  variable  (ym) , 
the  time  (ns)  , 


(um/ns)  , 


is  the  external  energy  input  (KeV/ns) 
The  boundary/initial  conditions  are 


(3) 


p(r,0)  =  Pg(r),  u(r,0)  =  Uq  (r)  ,  T(r,0)  =  TQ(r) 

and  u(0,t)  =  0. 

In  modeling  the  pellet  fusion  process,  we  have  implicitly 
assumed 

(a)  the  process  is  adiabatic 
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(b)  the  internal  viscosity  of  the  pellet  is  negligible. 

III.  THE  FULL  GROUP  (Boisvert  [ 3  ] ,  Ervin  [ 4  ] ) .  The 
family  of  one  parameter  groups  which  leave  equations  (l)-(3) 
constant  conformally  invariant  may  be  summarized  as: 


Theorem:  The  following  three  continuous  one  parameter 

transformations  leave  (  )-(  )  invariant. 


Q^:  (r,  t,  p,u,T)  (re 


2-n 
3 (n+1) 


2n-l 
3 (n+1) 


Q^:  (r,t,p,u,T)  (r,t+e,  p,u,T) 

Q^:  (r,t,p,u,T)  (r, t,  pe^,u,T)  . 


Proof:  follows  by  direct  substitution. 

Note:  Q2  corresponds  to  the  equations  being  translation 

invariant  with  respect  to  time.  and  indicates  that  the 

equations  are  invariant  under  a  two  parameter  dilatation  (or 
stretching)  transformation. 

IV.  APPLICATION  OF  THE  DILATATION  GROUP  (Ames  [  1  ] , 
Bluman  and  Cole  [ 2  ] ,  Ovsiannikov  [5  T)^  Assume 


- 

p  -  e  p. 


u  =  e^u,  T  =  e'^T,  r  =  e^r,  t  =  e^t. 


The  invariants  of  the  group  are 


t 

.A/e  ' 


f  (n) 


P  . 

a/ e  ' 


g(ri) 


.3/e  ' 


h(ri) 


e  /  0 

The  invariance  of  the  P.D.E.'s  (I  )-(3  )  imply  that 

3  _  n+1 
e  “  3 


(n+1) 


Following  from  the  transformation  described  by  (5)  and  subject 
to  the  condition  (6)-(8),  equations  (1)“(3)  become 

f  +  (2  +  |  +  ^)fg  -  ri(fg)  '  =  0  (9) 
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(10) 


g'  -  I 


Ti(gg'  +  R^h  +  Rh')  +  |.g2+  (^  +  l)Rh  =  0 


h'  '  ^  ri(gh'  +  (y-Dg'h)  +  (|+2(y-1)  +|-  (Y-l))gh  =  Bq,  (11) 

where  f  =  f(r|),  g  =  g(n),  h  =  h(n). 

Now,  let  us  investigate  the  boundary/initial  conditions 
under  the  dilatation  transformation 

(i)  p (r, 0)  =  Pq  (r) 

From  (5)  p(r,t)  =  r“'^^f(n),  so  that  p(r,0)  = 

Hence  assuming  that  pQ(r)  =  Ar^  gives  a/e  =  a  and 

f(0)  =  A. 

(ii)  u(r,0)  =  0 

Since  u(r,t)  =  r^'^^g(n)  it  follows  that  u(r,0)  =  r®^^g(0), 
whereupon  g(0)  =  0. 

(iii)  T(r, 0)  =  Tq (r) 

Since  T(r,t)  =  r'^^^h (n )  then  T(r,0)  =  r^^^ (0)  . 

Hence  with  T^Cr)  =  it  follows  h(0)  =  B. 

(iv)  u(0,t)  =  0 

Recall  u{r,t)  =  r^'^^g{n)  =  r^'^^g( -  ^y-)  = 
r*n+l)/3g(tr*n-2)/3) _  whereupon 

lim  r'''^^)/3g(t,r<"-2)/3,  =  o 


provided  n  ^  2. 

V.  CONSTRUCTION  OF  EXACT  SOLUTIONS.  Consider  the  case 
when  n  =  2.  Now  n  =  2  implies  3/e  =  1;  6/e  =  2;  A/e  =  0. 
Equations  (9)- (11)  then  simplify  to 

f '  +  (3  +|)fg  =  0  (i: 

g'  +  +  (2  +~)Rh  =  0  (1' 


h'  +  (3Y-l)gh  =  3, 


with  initial  conditions  f(0)  =  A,  g(0)  =  0,  h(0)  = 
From  (12)  it  follows  that 


f(n)  =  A  exp[-(3  +  -)  /  g(C)d5]. 

^  0 
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From  (14) 


h 

whereupon 

h' 


Equation  (13)  gives 


q'  + 

(2  +  f)R 

g"  +  2gg' 

(2  +  -)R 

e 


g'.(0)  =  -(2  +  f-)RB. 

Substituting  (15)  and  (16)  into  (14)  yields 

g"  +  (3y  +  Dg'g'  +  (3y  -  Dg^  =  -6q(2+|)R 
with  initial  condition 

g(0)  =  0,  g'  (0)  =  -(2  + 

The  change  of  variable 


s'  =  ygs, 


gives 

2 

s"  =  )js(g'  +  yg  ) 


(15) 

(16) 


(17) 

(18) 

(19) 

(20) 


and 

s'"  =  ys(g"  +  Syg'g  +  y^g^)  .  (21) 

In  particular,  for  y  =  5/3  (corresponding  to  a  mono- 
atomic  gas)  and  y  =  2,  equation  (17)  becomes 

s'"  +  23q(2  +  |-)Rs  =  0  (22) 

with  initial  conditions 

s(0)  =  1,  s’  (0)  =  0,  s"(0)  =  -2(2  +  |)RB.  (23) 

The  solution  of  (22)  and  (23)  is 

F 

s(n)  -  +  e  (c^  cos  Fn  +  c^  sin  Fn)  , 

where 

F  =  (2Bq(2  +  |)R)^/^ 
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and  are  given  by 


'  1 

-1 

1' 

1 

^2 

1 

3 

2 

1 

-1 

0 

.^3. 

_  0 

/3 

-  J-  p 

3,,  ^ 

From  equation  (19) 


and 


s  (n)  =  exp  [2  /  g(^)  d^] 
0 


g  =  s' /2s 


-p  sin(-~  Fn  +  (|)  +  y) 

1 


F 

-Fn  ^  /3 

-  c^e  sin(-^  Fn  +  4)) 


where 


22  -1 
c-  +  c-  and  (p  =  Tan  — 


For  Y  -  2/3,  we  choose  y  =  1  in  equation  (19)-(21).  Equation 
(17)  then  becomes 


s'"  +  6q(2  +  f)Rs  =  0, 


with  initial  conditions 


s(0)  =  1,  s'(0)  =  0,  s"(0)  =  -(2  +  f)RB. 


Letting 


G  =  (Sq(2  +  ,  G  /  0, 


the  solution  for  s<in)  is 

— Gn  T  ^  y3  /7 

s(n)  =  c^e  '  +  e  (C2  cos  Gn  +  C3  sin  Gn) , 


the  constants 

°1 

'^2 

,03  are 

given 

by 

■^l' 

■  1 

-1 

1 

1 

^2 

= 

1 

3 

2 

1 

-1 

0 

_0 

/3 

^ . 

- 

®  r 
3n 

(24) 


(25) 
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Then  g(r|)  is  given 


-  n 

c.e  sin(-^  Gn  +  (|>  +  -j) 

g(n)  =  -G - -Q - 

sin(-^  Gn  +  <p) 

where 

r~2  2”  -1  ^2 

=  \^2  ^3  ~  c~  * 

In  the  case  y  =  1/3 »  equation  (IV)  becomes 

g"  +  2g'g  =  -3o(2  +  f)R. 

Integrating,  we  obtain  the  first  order  differential  equa¬ 
tion 

g'  + 

Applying  the  initial  conditions  g(0)  =  0  and  g' (0)  = 

-(2  +  — )RB,  we  obtain  c  =  -(2  +  — )RB.  Substituting  this 

£  £ 

into  equation  (26)  results  in 

g'  +  g^  =  -3q(2  +  ^)R(n  +  • 

Upon  applying  equation  (19)  and  (20)  with  u  =  1,  equation 
(27)  becomes 


s"  +  3q(2  +  |)R(r| 
with  initial  conditions 


0, 


(26) 


(27) 


(28) 


s(0)  =  1,  s' (0)  =  0. 

g 

Making  the  change  of  variable  C  =  0  +  g—  ^  equation  (28) 
becomes  ^0 

s";  +  3q(2  +  |)RCs  =  0,  (29) 

with 

s(^)  =  1,  s'  (^)  =  0.  (30) 

^0  ^0 

Equation  (29)  is  a  Bessel  equation  of  order  1/3,  whose  solu¬ 
tion  is 

s(C)  =  5^/^[c^Jj^^3(f2(C)  )  +  )  ]  ,  (31) 

where 
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=  (6o(2  +  I  . 

Differentiating  yields 

s'(C)  =  J  [c^J^/3(fi)  + 

+  (3o(2  + 

that  is, 

s'(^)  =\  +  C2J_3^/3(J^)  ] 

+  3S2  (S2)  +  C2J_|_^^3  (S^)  ]  }  . 

For  the  application  of  initial  conditions,  it  is  convenient 
to  set 


“o  =  =  <eo<2  *  • 


Then  s(B/3q)  =  1  implies 


CiJi/3(f^o)  +  ^2'^-1/3^^0^  ” 


1/3 

and  s' (B/3q)  =  0  gives 

[CiJi/3(f^o)  +  ^2'^-1/3  ^^0^  ^ 


+  3Qq  tc^J|^3{f2Q)  +  C2‘^-1/3  ^^0^  ^ 


Using  equation  (32),  (33)  reduces  to 


^l'^i/3^^0^  ■'■  ‘^2'^-l/3  ^^0^  3fi„ 


Hence  c^^  and  C2  are  given  by 


1/3  •‘'*0’ 

j:i/3(“o» 

A  1/2 

'  B  ' 

_  -1-  r^^i/2 
3^^  ^  B  ^ 


Observe  that  det 

Wronskian  (J 


'^l/3^^0^ 


j[/3(r^o^ 


1/3 


(f2) 


■^-1/3  ^^0^ 

J_l/3(i^)) 


f2=U, 


(32) 


(33) 


(34) 


as  the  functions  Jj^^3(Q)  and  J  are  linearly  independent 
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solutions  of  (29)  then  the 


Wronskian 


0. 


Hence  Cj^  and  are  uniquely  determined  by  (34).  Since 


Wronskian  J_j^^3(fi)) 

then  from  (34)  we  have 


ttO_ 

1 

0 

/3 

“'^i/3(^o)  *^-1/3  ^^0^ 

Jn/3(%)  “'^1/3^%^  J 


Using  the  relationship 


-2  sun 

"“o 

,  V  1/2 

^  B  ^ 


3^^  '  B 


Jr-i^^^  -  Jr+i(z)  =  2J^(z) , 

s'(C)  =  j  c"^'^^{c^[j^^3(n)  +  |nj_2/3(n)  -  |nj^^3(fi)] 

+  C2[J_  1/3(0)  +  |oj_^/3(fi)  -  |oJ2/3(0)]}, 


and  therefore, 
g(n)  = 


1  /  _L  B  , -l/*^!  ^'^1/3  '*’  2  ^*^-2 

2  6,^  ^  ■Vl/3^"^  ^ 


-2/3  ~  2  *^4/3 


^2'^-l/2 


C2lJ_i/3(0)  +  ■|fiJ_4/3{^) 


}■ 


where 


0  =  (Bq(2  +  |)R)^/2  I  (n  +  . 

The  "similarity"  variable  n,  given  by 

n-2 

n  =  tr  ^ 

becomes  n  -  t  for  n  =  2. 

Hence  the  solutions  obtained  above  correspond  to  a 
'separation  of  variables  solution.' 

a 


p  (r,t)  =  r^  f  (t) 


(0)] 
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u(r,t)  =  rg(t) 

T(r,t)  =  r^h(t)  . 

The  substitution  described  by  equation  (19)  enabled  us 
to  rewrite  the  non-linear  differential  equations  (17)  and 
(18)  as  linear  equations  of  one  higher  order.  The  uniqueness 
of  the  solution  of  equations  (22) -(23),  (24) -(25),  and 
(29) -(30)  implies  a  unique  solution  of  equations  (17) -(18) 
for  the  cases  y  =  5/3,  2/3,  and  1/3. 


VI.  SINGULARITIES  OF  g(n)-  For  each  of  the  cases  dis¬ 
cussed  above  the  unknown  function  g  was  e-xpressible  as 

I 


s 

g  =  —  . 

^  ys 

Likewise  f  and  h  can  be  written  in  terms  of  s, 

A 


f  = 


(3+f) 


(35) 


and 

_  (y-1)  (s' )  ^  -  ys"s 

2,^  a,  2 
y^(2  +  |)Rs 

In  the  case  y  =  1/3,  combining  (  )  and  (  ) 

h  =  6„(n  +  i). 

From  equation  (34)  we  see  that  the  solution  to  our  system  of 
equations  (9)i-(ll)  becomes  singular  whenever  s(ri)  =  0. 
Physically,  infinite  values  of  g(ri)  are  not  possible;  however, 
the  fact  that  theoretically  it  does  become  unbounded  leads 
us  to  investigate  the  zeros  of  s(n). 


For  y  =  5/3 


y  =  2/3 


and  for 


s(n)  =  c^e”^^  -  c^e 


F 

2  /3 

sin  Fr)  +  <J)) 


F  =  (23^(2  +  ^)R) 


G 


s(ri)  =  -  c^e 

G  =  (6q(2  +  |•)R)^/^ 


2^ 


/3 

sin(-^  Gn  +  (}))  ; 
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Y  =  1/3  s(n)  =  (n  +  '‘'  '^2'^-l/3  ^  ' 

f^(n)  -  (3q(2  +  • 

In  the  first  two  cases  it  is  obvious  that  s(ri)  will  have 
infinitely  many  zeros  for  n  >  0.  Actually  as  n  becomes 
large  thfese  zeros  will  occur  almost  periodically  with 
periods  4tt//3  F  and  4tt//3  G  respectively.  For  y  =  1/3  let 
us  examine  equation  (31) 

s"(n)  +  6q(2  +  |■)R(n  +  =  0. 

Unlike  the  previous  two  cases,  in  which  the  zeros  occur  almost 
periodically,  the  zeros  of  s(ri)  (for  y  =  1/3)  occur  "more 
frequently."  To  see  this  we  apply  the  "Interlacing  of 
Zeros  Theorem"  [cf .  6] . 


Observe  that 


w(n)  =  (n  + 


is  a  solution  to  equation  (28) .  For  large  K  the  zeros  of 

J  .-(5)  are  separated  by  approximately  it.  As  ti  increases 
X/  -J 

the  amount  by  which  n  must  vary,  such  that 


((6o(2 


£)r)  V2 


(n  + 


^0 


increases  by  tt,  decreases.  Hence  the  zeros  of  w(ti)  occur 
"more  frequently"  as  n  increases.  Since  the  zeros  of  s(ri) 
and  w(n)  are  interlaced,  it  follows  that  the  zeros  of  s(n) 
must  also  occur  "more  frequently"  as  ri  increases. 
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Figure  4. 


Graph  of  h  for  n  =  2,  Y  =  5/3,  a/e  = 
6q  =  3.47,  A  =  5.10,  B  =  0.01. 
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AN  ENDOCHRONIC  APPROACH  AND  OTHER  TOPICS 
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Abstract 

(i)  A  synopsis  of  a  recently  developed  methodology  for  computational 
small-deformation  elasto-^plastic  analyses  of  cyclically  loaded  structures, 
based  on  a  variant  of  an  endochronic  theory,  is  presented.  The  present 
plasticity  model  is  shown  to  provide  a  unified  basis  for  the  earlier 
’’multi  ple-yi  eld-surf  ace"  or  "nonlinear  kinematic  hardening"  theories  of 
Mroz,  Krieg,  Dafalias  and  Popov,  Chaboche,  and  others.  (11)  Issues 
related  to  constitutive  modeling  of  finite  deformation  elasto-plastlclty, 
and  attendant  numerical  implementations,  are  discussed.  Consistent  rate 
forms  of  elastic-plastic  evolution  equations,  which,  in  the  limit,  model 
the  hyperelastic  or  small-deformation  elastic-plastic  behaviours  appro¬ 
priately,  are  presented. 

I.  An  Endochronic  Computational  Approach  to  Cyclic  Plasticity 
Without  much  loss  of  generality,  we  treat  only  deviatoric  plasticity.  Let 
0  be  the  stress,  aj  its  deviator;  de  the  strain-rate,  de’  the  deviatoric 
strain;  dCn,  the  mean  strain,  de'  =  (de®)'  ♦  deP;  the  plastic  strain-change 
deP  is  purely  deviatoric;  and  thus  the  differential  of  mean-strain,  de¬ 
noted  by  de^,,  is  purely  elastic,  i.e.,  den,  ^  consider  the 

solid  to  be  elastically  isotropic.  Using  additive  decomposition  of  de 
into  de®  and  deP,  we  have; 

do' 

de^  »  de'  -  •= —  (1.1a) 

-  -  2y 

Following  Valanis  [1],  we  define  endochronic  (internal  time)  (but 
Newtonian  time-like)  parameters:^ 

dc  -  (dcP;deP/''*  ;  dz  =  ;  f(o)  =  1  ,  d^  >  0  (1.1b,c,d) 

where  r(5)  is  monotonically  increasing. 

As  in  Valanis  [1],  the  stress  in  the  elastic-plastic  solid  is  represented 
through  the  integral: 


In  what  follows,  A*B 


and  A:B  =  A.  . 

-  Ij  ij 
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(1  .2) 


f2 

o'  =  2u  p(2  -  z*)  ~ 

j  Q 

Where  y  is  the  initial  (elastic)  shear  modulus,  and  p(z)  is  a 
material-specific  kernel.  Equation  (1.2)  thus  appears  to  circumvent  the 
need  for  a  yield  surface  as  well  as  for  the  flow  rules  of  classical  plas¬ 
ticity  theory.  Differentiation  of  (1.2)  leads  to: 


do'  =  2yp{de'  + 


h(z) 


do’ 


do' 


p(o)f(i;) 


[Us'  -  2r  ^ 


2y 


(1 .3) 


p(o)  =  P  at  Z  =  0;  y  =  y[l  +  p(o)  ;  h(z)  =.  f  (z-z' )  ~  dz 

^  '  Jo  (1.4) 


While  the  classical  loading/unloading  criteria  (or  criteria  for  elastic  or 
plastic  processes)  are  apparently  bypassed  in  Eq.  (1.3),  there  are,  nevei — 
theless,  prices  extracted  for  this  seeming  simplicity.  Some  of  these  coun¬ 
terbalancing  difficulties  of  the  above  endochronic  approach,  as  compared 
to  a  classical  plasticity  theory,  are  as  follows:  (i)  The  determination 
of  stress  history  (and  do)  for  a  given  strain  history  (or  de)  at  each 
material  point  becomes  highly  iterative  in  nature,  as  seen  from  (1.3); 
(ii)  In  a  finite  element/boundary-element/or  other  weak  solution  of  the 
boundary  value  problem,  the  trial  solution  de*  is  derived  by  differ¬ 
entiation  of  trial  displacements  du.  To  determine  the  trial  stresses  do' 
and  yet  retain  a  piecewise-llnear-equation  solution  strategy,  there  is  no 
recourse  other  than  to  approximate  Eq.  (1.3)  as  do'  =»  2ypde'.  Thus,  the 
stiffness  matrix  at  any  stage  of  loading  is  essentially  the  linear-elastic 
stiffness  matrix;  and  the  elastic-plastic  solution  method  becomes  the 
so-called  "initial-strain"  method;  (iii)  To  model  the  uniaxial 
stress-strain  curve  of  a  material  that  does  exhibit  a  sharp  'knee'  near 
the  elastic  limit,  the  kernel  p(z)  has  to  be  weakly  singular  at  z  »  0. 
These  drawbacks  notwithstanding,  Valanis  and  Fan  [2]  have  recently  pre¬ 
sented  a  series  of  papers  dealing  with  a  direct  computational  implementa¬ 
tion  of  an  iterative,  initial  strain  method  based  on  Eq.  (1.3)  and  using 
exponential  functions  for  the  kernel  p(z)  in  Eq.  (1.4).  Details  of  compu¬ 
tational  times  for  achieving  convergence  of  plasticity  iterations  of'  the 
global  finite  element  equations,  or  of  the  iterations  for  stress  integra¬ 
tion,  are  not  readily  available  in  [2]. 
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Watanabe  and  Atluri  [3-6]  have  recently  presented  alternate  characteriza¬ 
tions  of  cyclic-plasticity  constitutive  relations  using  the  essential  con¬ 
cepts  of  an  endochronic  theory,  but  with  the  following  features:  (i)  The 
notion  of  a  yield-surface,  and  the  demarcation  in  the  definitions  of  the 
elastic  processes  and  plastic  processes,  are  retained;  (ii)  The  stress 
history  (or  do),  for  a  given  strain  history  (or  de),  can  be  determined 
quits  easily,  as  in  a  classical  plasticity  theory,  by  using  a  "gener¬ 
alized-midpoint-radial-return"  algorithm;  (iii)  The  finite-element  formula¬ 
tion  can  ..be  based  .on  a  "tangent-stiff ness",  approach  ,■  wherein  the  material 
constitutive  law  at  each  point  can  be  chosen  differently  depending  on 
whether  an  elastic  process  or  a  plastic  process  is  postulated  at  each 
point  during  the  current  'load'  increment;  and  (iv)  The  present  [3~6]  endo¬ 
chronic  approach  provides  a  unified  basis  for  other  well-documented  theo¬ 
ries  to  model  cyclic-plasticity,  viz.,  the  "multiple-yield-surface"  theo¬ 
ries  of  Mroz  [7],  Krieg  [8],  and  Dafalias  and  Popov  [9];  and  the  kinematic 
hardening  theories  of  Prager,  and  Chaboche  Cl2].  The  starting  point  in  the 
work  of  Watanabe  and  Atluri  [3“^]  is  the  representation  of  the  kernel  p(z) 
in  Eq.  (1.2)  in  the  form; 

p(z)  =  6(z)  +  p^(z)  (1.5) 

where  6(z)  is  a  Dirac  function  and  pi  (z)  is  a  non-singular  function.  It 
turns  out  [3-4]  that  the  term  pQ  6(z)  in  Eq.  (1.5)  leads  to  the  notion  of 
a  yield  surface;  the  function  f(i;)  in  (1.2b)  leads  to  the  notion  of  a 
yield-surface-expansion  (isotropic  hardening);  and  the  function  pi(z)  in 
(1.5)  leads  to  the  notion  of  yield-surface  translation  (kinematic 
hardening).  Use  of  (1.5)  in  (1.2)  leads  to: 

de^  fZ  de^ 

2'  ■  »0  dF  *  dF 


dG*" 

dz 


a'  (z) 


(1 .6b) 


wherein  the  definitions  of  and  a’  (the  "back  stress")  are  apparent. 
Eq.  (1.6b)  can  be  written  as: 
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dC  >  0 


(1 .7) 


(o'  -  a’) 
de^  =  - 

T°f 

y 


; 


Of  course,  Eq.  (1.7)  is  entirely  reminiscent  of  the  classical  flow-rule 
and  normality  relation  for  plastic  strain-rate  using  a  Mlses’  yield 
criterion.  However,  at  this  point,  this  similarity  is  purely  formal. 


From  the  very  definition  of  di;  as  in  (1.1b).  it  follows  that,  during 
plastic  flow, 


^  ~  =  1  ,  i.e.,  (a'  -  ot'):(o'  -  a')  «  [t°  Tiof  (l.Sa.b) 

Equation  (1.8b)  clearly  indicates  that  during  plastic  flow,  the  stress 
point,  in  the  deviatoric  stress  space,  remains  on  a  Mises-cylinder  of 
radius  f(?),  with  the  center  of  the  surface  at  o'. 


By  differentiating  (1.7)  with  respect  to  c, 
relation  which  holds  during  plastic  flow: 

d^e^  de*^  df  do'  da'  ^ 

y 


one  obtains  the  following 


(1.9) 


From  (1.1),  and  the  definition  of  o'  as  in  (1.6),  respectively,  we  see 
that : 


do'  de'  de*^ 

d?  ”  ^d^  dc  ^ 


(1.10) 


and 

da'  de^  h 

™  =  2u  [p,(o)  ^  +  ~]  (1.11a) 


where  h  = 


z  dp, 


9e^ 


— -  (z-z' )  - 

dz  3z' 


dz' 


(1  .11b) 


Use  of  (1.10)  and  (1.11a)  in  (1.9)  results  in; 
[l  .  .  .  -O  (df/dt) 


de' 


-  P,(0)  .  T- 


2u 


]dG^ 


li*d^  T°f  d^G^ 


Also,  during  plastic  flow,  it  follows  from  (1.8a)  and  (1.7)  that: 
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(1.13) 


^2  P  .  P 
a  e  de 


d^ 


dC 


d^e^ 


) 


Taking  the  trace  of  both  sides  of  (1.12)  with  [(o'  -  o’)/(T^f)]  [or 
which  is  also  equal  to  (deP/d^)!  and  using  (I.13),  one  obtains: 

(o'  -  a')  T°(df/d^)  h*:(o'  -  a') 

de*  t  — ^ —  =[!■'■  Pi(o)  ^  5 -  ]  d!;  s  c  dc 

"  I  2\i  o„ 

V  (i-i**) 


wherein  the  definition  of  C  is  apparent. 


dc.i[ 


tde* ) : ( a’  -  a’ ) 


T°f 

y 


]  s  1  de' 


Eq.  (I.IM)  can  be  rewritten  as: 

:  N  (1.15) 


where  N  =  (o'  -  a')/-t°f  is  a  unit  "Normal". 

Mow,  by  definition,  during  a  "Plastic  Process",  i.e.  when  deP  ^  0,  we  have 
dc  >  0.  Thus,  (1.15)  clearly  indicates: 


(A)  Definition  of  a  Plastic  Process  (P);  d^  >  0 

(P)  if  (i)  (o'  -  a'):(a'  -  a')  =  ^  de' :N  >  0  (I.16) 

Equation  (1.15)  also  indicates  that  a  "plastic  process"  is  not  possible  if 
N:dc'  S  0.  In  confirmity  with  this,  we  define  an  "elastic  process"  as 
follows: 


(B)  Definition  of  an  Elastic  Process  (E):  d?  -  0 

(i)  if  (o'  -  a'):(o'  -  a')  <  (T°f)^  ;  or  (1.17a) 

(ii)  if  (o'  -  a'):(o'  -  o')  *  and  N:de'_S  0  (1.17b) 

It  is  interesting  to  observe  that  the  (Elastic)  and  (Plastic)  processes 
defined  above,  for  the  present  endochronic  theory,  depend  directly  on 
whether  (M'.de)  =  0;  while  in  the  classical  plasticity  theory  these 
processes  depend,  ab  initio,  on  whether  (M;do)  ^  0.  In  computational 
mechanics,  the  central  problem  of  plasticity  is  to  determine  do,  given  as 
de.  In  this  context,  the  (E)  and  (P)  criteria  of  (1.16)  and  (1.17)  are 
more  direct  and  more  meaningful.  Using  (1.15)  in  (1.7),  we  obtain; 


During  (P); 

de^  -  1  N(N:de')  *  ^  N(N:de)  (1.18) 

since  N  is  deviatoric.  Recall  that  o'  [see  (1.6)3  and  h*  [see  (1.11b)], 
and  through  them,  the  coefficient  C  [see  (1.14)]  depend  on  the  kernel 
Pi (z) . 


A  convenient  choice  for  the  kernel  Pi (z)  is: 
Pl(z)  =  1  P^j^  exp(-  B^z) 


(1.19) 


such  that,  from  (1.6)  it  follows  that: 
/■z 


de' 


and 


S*  =  I  2u  [  {p  exp[-  8  (z-z')]  ~  dz' 
i  ^o 

i  % 


_  r  ,(i) 

=  4  0' 


(1 .20a) 


(1 .20b) 


with  being  defined  in  an  apparent  fashion.  From  (1.20)  it  follows 

do'  =  2ii^  P^(o)  de^  -  1).  ^ -  }  (de^:d£^)'^* 


8. o' 

P  -  II  — 

i  ^ 


(1  .21  ) 


Thus,  the  evolution  equation  for  o'  is  nonlinear  in  deP  and  thus  is 
similar  to  a  nonllneai — kinematic-hardening  relation  [12].  It  has  been 
discussed  in  detail  by  Watanabe  and  Atluri  [4]  that  the  present  theory, 
with  the  translation  of  the  yield  surface  as  in  (1.20),  and  the  expansion 
of  the  yield  surface  as  specified  by: 

f  “  (1  +  Y?)  [linear]  ,  (1.22a) 

or  f  =  a  +  (1  -  a)exp(-:jjc)  [saturated]  ,  (1.22b) 

where  Y  and  t  are  constants;  and  5  =  /dc;  includes  the 
multiple-yield-surface  theories  of  Mroz  [7],  Krieg  [8],  and  Dafalias  and 
Popov  [9]  as  special  cases. 


Based  on  (1.18),  the  stress-strain  relation  in  the  present  theory  may  be 
written  as: 

do'  =  2m  [de'  -  r  1  N  (N:de')] 

O'  ^  '  “624 


(1 .23a) 


(do:l)  •»  (2ii  +•  3A)(de;I) 
where  r  =  1  In  (P)  and  P  =•  0  in  (E). 


3y  assuming  r  “  1  or  0  appropriately,  one  may  proceed  to  develop  a 
tangent-stiffness  finite-element  method  in  the  usual  fashion.  If  the 
stress  Ofj  at  state  C^,  in  an  incremental  solution,  is  known,  the 
incremental  stresses  Ao  corresponding  to  the  trial-solutions  Ac  for 
incremental  strains  are  determined  in  the  usual  fashion.  We  assume  that 
Ofi  is  on  the  yield  surface  and  further  assume  that  the  process  had  been 
plastic;  i.e.,  ||  ( 2yAe*)  -  Oj^H  >  (xy  f^)*  Then,  for  any  6 
such  that  0  <  9  <  1,  the  algorithm  for  determining  the  actual 
stress-increment  Ao  in  the  plastic  process  proceeds  as  follows: 

rj  - - — _ 

II  (c;  >  2M0Ae*  )  -  a^^ll 

^  (!i0!^'£)N9  5  (I.Zilb) 


T°(df/dc)  h*!(o'  -  s') 
(where  C  *■  [l  +  p,(o)  +  - II - H — 

y 

Ao'  -  2p[Ae'  -  i  (NQ;Ae)NQ] 
n 

Ao:I  -  (2p  +  3A)(Ae:I) 

AC  =  (Ac’’;  Ac’’)'*  ;  f^^^  -  f^  +  Af 

“n+1  “  P^(o)Ae’’  -  (I  - -  )ac} 

i  n 


(1 .24c) 
(1 .24d) 
(1 .24e) 


(1 .24f) 


Of  course,  several  variants  of  the  above  algorithm,  such  as  sub-incremen¬ 
tal  ones,  are  possible.  The  above  tangent-stiffness  finite-element,  and 
generalized  mid-point-radlal-return-stress-lntegration,  algorithm  has  been 
used  by  Watanabe  and  Atluri  [5,6]  to  solve  several  problems  of  cyclic  plas¬ 
ticity  and  non-proportional  biaxial  loading.  It  has  been  found  that  the 
present  models  capture  the  experimentally  observed  phenomena  of  cyclic 

hardening,  cross-hardening ,  ratcheting,  etc. 
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Because  of  the  superior  predictive  capabilities  of  the  present  model  and 
the  fact  that  It  is  no  more  difficult  to  Implement  than  the  usual  (classi¬ 
cal)  plasticity  models,  it  may  be  a  candidate  for  further  exploitation  in 
general  purpose  computational  programs. 

II .  Towards  a  Consistent  Finite  Deformation  Plasticity 
Theory  and  Attendant  Computations; 

In  dealing  with  finite  deformation  problems,  it  has  been  (and  to  a  large 
extent  still  is)  customary  in  the  computational  mechanics  literature  to 
postulate  constitutive  theories  in  the  form  of  linear  relations  between  an 
objective  rate  (usually  the  Jaumann  rate)  of  stress  or  an  objective  rate 
of  an  internal  variable  (such  as  the  back  stress  in  a  kinematic  hardening 
plasticity  theory)  on  the  one  hand,  and  (the  objective)  velocity  strain 
(or  the  symmetric  part  of  velocity  gradient)  on  the  other.  In  this 
section,  we  reexamine  such  approaches  and  point  out  several  alternative 
paths  towards  rational  and  consistent  theories  and  attendant  computational 
algorithms  for  finite  deformation  elasto-plasticity. 

First  we  examine  hyperelasticity,  and  then  elastoplasticity,  with  the  aim 
that  a  finite-deformation  elastoplasticity  theory  should  remain  valid  in 
the  limits  of  hyperelasticity  as  well  as  of  small-deformation  elastoplas¬ 
ticity. 

II . 1  Hyperelasticity 

We  consider  here  an  isotropic  hyperelastic  solid  wherein:  F  is  the  defor¬ 
mation  gradient  with  polar  decomposition  F  =■  B’^e  ^  Ye'S'  where  Ug  and  Vg 
are  the  elastic  stretches  (Ug  *  U  and  Vg  s  V  in  this  case);  R  the  rigid 
rotation;  Vigg  the  elastic-strain-energy  density  per  unit  initial  volume;  o 
the  Kirchhoff  stress  (=  Jt  where  J  =  det  F,  and  t  the  Cauchy  stress).  For 
observer  transformations  denoted  by  an  orthogonal  rotation  Q,  it  is  well 
known  that  the  various  quantities  transform  as:  F  5  ^  9*~’  He 

Ug;  V  and  a  ■*  Q-o-Q*-.  Based  on  these  oberver-f name  related 

transformations,  one  may  write  an  objective  constitutive  relation  for  an 
isotropic  hyperelastic  solid  in  any  number  of  alternative  form’s  as 
follows: 
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(2.1) 


0  *  R*Jr(U  )*r'^  *  F‘S(U^)*f’^  *  f’'^*C(U^)*F  ’  =  f'’^*H(IJ  ) ‘F*^  = 


=  E*c(y  )*!i^  *  Vl^U  “ 

“*  *  ~©  *  *m  ^  *^0  ^  ^0 


where  F  is  the  "rotated"  stress,  S  is  the  2nd  Piola-Kirchhoff  stress,  C 
the  "convected"  stress,  M  is  another  "induced"  convected  stress,  r  the 
"Biot-Lure-Jaumann"  stress*  The  physical  interpretations  of  these 
tensors,  as  well  as  that  of  T,  are  given  in  Atluri  [13]-  The  tensors  F, 
S,  C,  M,  and  r  are  functions  of  alone  and,  hence,  are  observer 
invariant,  while  T  is  a  function  of  V^and  thus  transforms  as  T  *  Q*I*Q^* 
From  (2.1)  it  is  easy  to  note  the  following  equalities; 


jr(U  )  =  U  •S(U^)-'J 
-  -e  -e  -e  -e 


u’^*C(U  )*u'’  =  •M(U  )*U  =  U  •r(U  ) 

-e  -  -e  -e  ~e  ~  -e  -e  ~e  ~  ~e 


(2.2) 


As  shown  in  detail  in  Atluri  [13],  for  isotropic  elastic  solids,  the 
following  relations  exist  between  the  various  stresses  and  the 
elastic-strain-energy  density  WgeJ 


and 


3W 


oe 

ainU  ’ 


~e 


3W. 


oe 


3^e 


(2.3) 


(2.4) 


where  J,n  (^)  denotes  the  natural  logarithm  of  (^).  For  initially 
unstressed  solids,  we  have  the  restrictions; 

r  =  S  =  C  =  r  =  0  when  Ug  =  I  ;  and  T  =  o  =>  0  when  =  I  .  (2.5) 


Based  on  (2.4)  and  (2.5),  one  may  define  a  resticted  class  of 
"semi-linear"  isotropic  hyperelastic  solid  through  the  relations; 


F  =  2u  InUg  +  x[[i!,nUg);l]l  (2.5a) 

S  =  2m(u^  “  i)  *■  (2.5b) 

C  =  2u(u"’  -  I  I  +  X[(U'^  -  (2*50. 

^*0  6 

r  =  2u[U  -  l)  *  x[[u  -  (2.5d) 

"*0  0 

T  »  2u(v  -  l)  +  X[(y  -  (2.5e) 

^0  ^  ^ 
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wherein  it  is  possible  that  the  coefficients  u  and  X  may  have  different 
numerical  values  in  each  of  Eqs.  (2.5a-f). 

Let  L  =  be  the  velocity  gradient,  D  =  (L)g  [=  ‘/j(L  +  be  the 
velocity  strain  [(^)3  and  ( ^)a  denote  the  symmetric  and  antisymmetric 
parts,  respectively,  of  ( J];  w  =  (L)a  [=>  VaCL  -  L^)]  be  the  spin;  and  n  = 
R'R*'.  Now,  by  differentiating  Eqs  (2.1)  with  respect  to  time,  one  easily 
obtains  a  set  of  objective  rate  relations: 


a  “  n*tJ  o*Q  =  Or,  “  R*r»R 


0  -  w*o  +  o*w  ®  0,  =  R*ll!  -  *4(y  ‘U  ’  -  y  ^-y  }*r 


'  -y  -  y  -y"*  j}*R^  =  R-{r  -  w-r  r-w)*R^ 


0  -  L*o  “  o'L**  =  0^  “  5  ^i^*2  *  2*li  ^  Sr  “  C  ^ 

a  ■*■  L^*o  -  “  E  2  '  k*2  2*5  “  2*  “  E‘C'5*^ 

XT 


i  =  ^  '  2  “  2u  ^  (invj  *  x[^  (l!.nvj:l]l 


'dt  -e- 


In  (2.6b),  W  is  the  observer-invariant  form  of  spin: 


(2.6a) 

(2.6b) 

(2.6c,d) 

(2.6e,f) 

(2,7) 


w  =  r’^'W^r  -  r'^.r  =  r'^.(w  -  2)*R 


(2.8) 


Also,  oq  is  the  Green-Naghdi  rate,  oj  the  Jaumann  rate,  ox  the  Truesdeli 
rate,  or  the  Rivlin  rate,  etc.  However,  using  (2.2)  and  (2.1),  all  the 
rate  equations  (2.6a-f)  may  be  written  in  a  unified  form  as: 


(2.10) 

where  R^'Oq'R  may  be  considered  as  the  Green-Naghdi  rate  in  the  rotated 
frame.  If  the  eigendirections  of  Ug  are  represented  by  the  matrix  a  and 

those  of  Vo  are  represented  by  S,  we  have  (see  Atluri  C13]): 
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(2.11a) 


—  "(inll  1  =  a^*  ~  (Hnxl'a  +  a'^'lnX'a  a^*ZnA*a 
^  (tnV  )  «  |r  (lnu]-B  +  6^-inu*B  +  s’^-lnu'S  (2.11b) 


where  A  and  are  diagonal  matrices  consisting  of  the  eigenvalues  of  U^and 


V  respectively,  and  4r  InA  =  A  =•  A*A  \  Thus  when  (2.11a)  and  (2.11b) 
are  used  in  (2.5a)  and  (2.5f),  respectively,  the  expressions  for  f  and  o 
become  rather  complicated.  On  the  other  hand,  when  the  elastic  stretches 
are  small ,  i.e.,  Ue  =  I  *  =  I  ^  £v  where  each  of  the 

components  (eij)u  and  (eij)v  is  <<'1,’"then 


UnV 

-e 


(2.12a,b) 


Thus,  for  moderately  small  elastic  stretches,  (2.5a),  (2.5f),  and  (2.12) 
lead  to: 


r  =  ~  ‘4(£,j-£u  "  £u*£u^  " 


^  4l£u  -  ‘4(£u-£u  ^  £u-£.j)  * 


(2.13a) 


and  a  similar  relation  for  o  with  being  replaced  by  ly.  For  arbitrary 
and  Vg,  it  appears  more  algebraically  convenient  to  work  with  relations 
of  the  type  (2.5b),  (2.5d),  and  (2.5e).  Considering,  for  instance,  (2.5d) 
and  (2.5e),  one  has  from  (2.10): 

-e'^e^  +  0^(0^:!)]  -  (2u  +  3X)U^ 

(2.1M) 

or,  alternatively, 

0  =  2M(Vg*Vg  +  ^tVgfVgtl) 


(2.15) 


Now,  consider  the  polar  decomposition,  F  =  5*Ue  ”  From  this  we 

•  -1  -  -1  t  •  -1  “1 

F-F  =  R-U  ‘U  *R  =  V  *V  +  • 

^  ^  —P  —  ~P  '-p  —0  ~0 


nave  l 


Thus,  D  =  ^ 


-*1 


“1 


i  "  a  *  a-(ae%'>a5'  ■ 

9y  defining  coordinate-invariant  ve^^^ity-strain  D,  and  spin  W,  such  that: 


.-1 


(2.16) 

(2.17) 


W  =  r''*(W  -  5)  *5  and  6  =  r''*D*R,  it  is  easy  to  derive  from  (2.16  and  2.17) 
that: 

U  ^  (d  +  w)*U  ^  U  *(5  “  w)  (2.18) 

-e  -  ~  ~e  -e  ~ 

and  V  =  (d  +  w)*v  -  V  ^  v  ‘(d  -  w)  +  (2.19) 

-e  ~  ~  ~e  ~e  ~  -e  -  -  ~  ~e 

Use  of  (2.18)  and  (2.19)  in  (2.14)  and  (2.15)  results  in  consistent  forms 
of  'objective*  rate-constitutive  relations  for  hyperelastic  solids.  The 
resultant  rate  equations  clearly  indicate  the  fallacy  in  the  current 
practice  wherein  a  finite  deformation  (even  hyper-elastic)  stress-strain 
law  is  simply  expressed  as  a  linear  relation  between  an  objective  rate  of 
stress  (usually  oj)  and  D.  For  Instance,  it  is  currently  common  practice 
to  write, 

(oj)  or  (Oq)  =  2UD  +  X(D!l)I  (2.20) 

Equations  (2.14  and  2.18)  and  (2.15  and  2.19)  clearly  show  that  (2.20)  is 
not  valid.  In  fact,  as  Is  well  known,  Eq.  (2.20)  with  oj  leads  to 
oscillatory  shear  stresses  in  a  finite  deformation  simple  shear  test  Cl  4]. 

While  the  above  hyperelastic  rate  relations  have  their  own  intrinsic 
reasons  for  being,  our  objective  here  is  to  use  them  as  guides  to 
postulate  consistent  rate-type  finite-deformation  elasto-plastlc  relations. 
This  is  pursued  next. 

II. 2  Rate-Type  Finite-Deformation  Elasto-Plastic  Stress-Strain  Relations 
The  primary  criteria  we  shall  demand  these  relations  to  satisfy  are: 
(1)  The  finite  elasto-plastlc  rate  relations  should,  in  the  limit,  be 
valid  in  the  cases  of  hyperelasticity  as  well  as  small-deformation 
e lasto-plast I c ity ;  (2)  They  should  obey  the  so-called  Prager's  [15] 
criterion.  This  criterion  implies  that  when  the  objective  stress  rate 
vanishes  [by  virtue  of  the ' vanishing  of  the  right-hand  side  of  the 
equation  with  the  said  stress  rate  on  the  left-hand  side],  then  the  second 
Invariant  of  the  Kirchhoff  stress  (0)  should  remain  constant.  Otherwise, 
in  a  theory  of  plasticity,  spurious  plastic  flow  may  result; 

(3)  The  classical  plasticity  concepts,  such  as  Drucker's  postulates 
regarding  plastic  work  and  plastic  normality,  should  hold. 


We  will  assume  that  the  solid  is  elastically  isotropic.  The  main  theme 
adopted  now  is  that:  the  stress  in  a  finitely  deformed  elasto-plastic 
solid  may  be  derived  from  an  elastic-strain-energy  density  function  W^g, 
such  that: 


9W, 


oe 


dS,nU 


-e 


_ 2£  .  ofv  }  =  —Qj. 

3V  ’  ~'-~e^  32,nV 

-e  -e 


(2.21) 


i.e.,  in  a  manner  entirely  analogous  to  Eqs.  (2.3  and  2,U).  Thus,  in  a 
finitely  deformed  elastic-plastic  solid,  the  'objective’  relations  for  the 
stress  rate  are  still  given  by  equations  of  the  type  Eqs.  (2.1^  and  2.l8) 

•  m 

or  Eqs.  (2.15  and  2.19),  provided  now  Ug  and  Vg  are  appropriately  defined 
in  terms  of  the  kinematics  of  an  elastic-plastic  deformation.  As  to  this, 
Lee  [16]  has  originally  suggested: 

F  =  F  ‘F  (2.22) 

-e  -p 

with  subscripts  e  and  p  denoting  'elastic'  and  'plastic',  respectively. 

However,  it  is  well  known  that  Fg  in  (2.22)  is  not  unique,  but  can  be 

determined  to  only  within  a  rigid  rotation,  Fardshisheh  and  Onat  [17] 

have  suggested  that  this  non-uniqueness  can  be  remedied  by  requiring  Fg  to 

be  a  pure,  symmetric  stretch  tensor,  Vg.  Thus, 

F  =  V  -F  (2.23) 

~e  -p 

By  requiring  Vg  to  be  similar  to  that  in  an  otherwise  hyperelastic  solid, 
we  see  that  under  observer  transformations,  the  following  results  hold: 

F  +  Q.p  ;  V  Q*V  ;  and  F  +  Q‘F  .  Note  that  in  (2.23),  the  rigid- 
~  '  ~e  -  ~e  '  ~p  ~  'P 

rotation  of  a  material  element  is  not  explicitly  determined.  With  the 
apparent  loss  of  some  generality,  however,  we  study  here  the  decomposi¬ 
tions: 

F  =  R-U  ‘U  ^  V  ‘V  ‘R  (2. 2^1) 

~  -e  -p  -e  -p  - 

where  R  is  the  rigid  rotation,  Ug  and  Vg  are  'elastic'  stretches,  and  Up 

and  Vp  are  the  'plastic'  stretch  tensors.  We  requre  Ug,  Up,  Vg,  and  Vp  to 

be  svmmetric.  Under  observer  transformations  it  then  follows  that:  Ug-»Ug; 
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Up-Upj  Vp^Q-Vp-Q*^;  and  also  V^=  R'Ug'R*^;  and  Vp  =■  R'^p-R'^. 

Under  the  decomposition  (2.23),  we  have; 


,-1 


-1  „-l 


L  V  'V  +  V  ‘F  ‘F  ‘V 


-e  -e 


-e  -p  'p 


•e 


(V  )  +  (V  ‘F  -F  ^  'V~'  ) 

—0  s  *^6  "^p  '^p  "^S 


w  a  (v  1  +  (v  ‘F  -f"^  •V~^  1 

'©  ^6  a  "^6  '^p  ’'P  ""6  SL 


(2,25a) 

.v-1 

-p  -e 

(2,25b,c) 


From  (2.25)  it  easily  follows  that 

•1 


V 

~e 


(D  +  W)  'V  -  V  '[f  'F"  )  s  V  *(0  -  W)  -  (F 
*•  - .  .  -e  '  ~p  'p  -e  '■  ~  -p  -p  -e 


(2.26) 


Now,  the  stress-power,  or  the  rate  of  stress-work,  in  a  finitely  deformed 
elastic-plastic  body  (per  unit  initial  volume)  is  given  by; 


W  =  o:D  s  W  +  W 
O'-  oe  op 

Thus,  if  D  is  decomposed  into  D  =  D  +  D  ,  we  will  have; 


(2.27a) 


W 


oe 


o:D 
-  -e 


o;D 

-  -p 


(2.27b,c) 


The  well-known  postulates  of  Drucker,  concerning  WQp,  may  then  be  used  to 
establish  the  normality  of  Dp  to  the  yield  surface,  expressed  as  a 
function  of  o.  This  suggests  the  definitions: 


D  = 
'P 


•F 

P  ~P 


W 

-P 


(2.28) 


where  Dp  and  are  the  "plastic  velocity-strain"  and  "plastic  spin", 
respectively.  Using  (2.28)  in  (2.26),  we  have; 


V  =  (D  +  W)*V 
-e  -  -  -e 


(D  +  W  )*V  V  *(0  -  W)  -  V  ‘(D  -  W  ) 

-,p  —p  ^  "“C  *"  "“S  '  "^P  ""P  ^ 


W  )*V  3  V  ‘(d 

-e"^  -e  '6  '•'€ 


(2.29) 

(2.30) 


The  use  of  (2.29)  in  (2.15)  will  then  result  in  an  objective, 
elastic-plastic  stress-strain  rate  relation  for  finite  deformations.  We 
now  need  evolution  equations  for  Dp  and  Wp  as  defined  in  Eq.  (2.28), 
which  may  be  written  down,  using  the  general  isotropic  function 
representation  theorems,  such  as  due  to  Wang  [18].  In  this  connection, 
the  work  of  Loret  [19]  in  representing  directly  the  quantity  (Fp*Fp^) 
is  useful  and  noteworthy.  From  this  single  representation,  Loret  [19] 
determines  both  Dp  and  Wp. 
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However,  there  appears  a  slight  Inconsistency  in  the  above  approach.  If 
the  process  is  purely  elastic  (and  thus  the  material  behaviour  approaches 
hyperelasticity)  or  when  there  is  no  plastic  process  as  determined  by  a 
flow  rule,  it  follows  that  Dp  =  0.  This  necessitates  to  be 

zero,  in  an  elastic  process.  Thus,  in  an  elastic  process,  (2.29)  reduces 
to: 

V  -  (D  +  W)*V  (2.31) 

-e  -  -  ~e 

which  does  not  agree  with  the  hyperelastic  relation,  (2.19),  i.e.,  Vg  » 

(S  ^  -  Ve*2- 


On  the  other  hand,  using  the  decomposition  (2.2*1),  we  have: 


L  - 

r.r'^  4-  R.u  -u’^-R^  +  R*U  -U 
-  ~  -  -e  -e  -  -  -e  -p  -p  -e  - 

(2.32) 

or 

D  - 

R.(U  •u'M  •r'^  +  R*(u  ‘U  ‘R^ 

-  '^-Q  -Q  's  -  -  '-e  -p  “P  *e  ■'s  ' 

(2.33) 

and 

w  » 

r-r'^  >  R‘[0  -u'M  +  R*(u  *0 

—  '^0  ~0  a  ^  ^  '*"6  '■p  ““6  a 

(2.3*0 

Equations 

(2.33)  and  (2.3*1)  may  be  rearranged  as: 

5  » 

r’^'D'R  -  (U  ‘U”’)  fu  •U 

-  -  '■'e  -e  3  '-e  ~p  “p  *e  s 

(2.35) 

w  - 

r'^‘(w  -  n)*R  -  (0  'U’M  +  (u  *0 

-  '  -  '•-e  -e  a  ^-e  -p  ~p  -e  a 

(2.36) 

Now,  the  stress-power  per  unit  Initial  volume  may  be  written  as: 

Wg  =  ojD  ^  R*C*5'^)5D  ®  Jr5(R'^*D»R)  -  Jr:D  (2.37) 

Thus,  if  D  is  decomposed  into  5  -  D  +  D  ,  we  have  W  =  Jr:D  ;  W  =»  Jr:5  . 

’  -  t-  **  -e  -p  oe  -  -e  op  -  -p 

Now,  the  yield  function  may  be  expressed  as: 

f(r,W  )  -  0  (2.33) 

-  op 


since  the  Invariants  of  Jf  are  the  same  as  those  of  o.  The 
Drucker-normality  would  then  apply  to  Dp  in  the  deviatorlc  space  of  f. 
Thus,  we  may  define: 


D 

-p 


(U  *0  ‘U"’ -u’^  ] 

'  ~e  ~D  -p  -e  '3 


(2.39) 


633 


(a.iio) 


Using  (2.39),  (2.35),  and  (2.36),  we  have; 


w)*y,  - 


W  )‘U 


e 


U 


(s  -  i)  -  u^-(5p  -  “p) 


(2.1)1  ) 


the  use  of  (2.1)0)  in  (2.14)  results  in  an  objective  elastic-plastic 
stress-strain  relation  that  is  consistant  for  finite  deformations.  We  may 
use  representation  theorems  [18]  to  write  down  a  general  expression  for 
(Up*UpM  from  which  both  Dp -and -Wp  can  be  determined  from  (2.39),  such 
that  Dp  satisfies  the  normality  condition.  We  may  essentially  follow 
Loret  [19]. 


The  development  in  (2.40),  in  conjunction  with  (2.14),  will  remain 
consistent  in  the  limit  of  hyperelasticity.  Thus,  when  Up  =  i ,  Up  ^  0, 
Dp  =  0  “  Wp,  and  thus  Ug  =  (D  +  W)»Ug,  which  agrees  with  the  hyperelastic 
relation,  (2.18). 

Also,  the  present  suggestion  of  an  elastic-plastic  constitutive  relation 
based  on  (2.40)  and  (2.14)  does  satisfy  the  Prager  condition  [15].  Thus, 
when  r''*oq*R  =  0,  it  is  clear  that  since  the  rate  of  stress  in  a  rigidly 
rotating  system  is  zero,  the  invariants  of  the  Kirchhoff  tensor  remain 
constant.  This  is  not  the  case  when  other  stress-rate  eo.uations  as  in 
(2.6c,d,e,f)  [i.e.,  rates  in  non-rigidly-spinning  systems]  are  used  and 
when  the  right-hand  sides  of  (2.6c,d,e,  and  f)  are  consistently  determined 
(through  time  differentiation)  from  (2.5b,c,d),  respectively.  If,  on  the 
other  hand,  one  uses  ad-hoc  postulations,  as  is  commonly  done  currently, 
with  only  a  linear  relation  between  an  objective  stress  rate  and  D,  such 
as: 

(o^)  or  (oj^)  =  2u[d  -  A(N:D)n]+  (2.42) 

then,  Ox  O'’  Or  =  0  implies  that  D  =  0,  and  then  L  =  W,  and  hence  ox  ^"d  or 
reduce  to  oj.  Thus,  since  oj  is  a  spin-based  rate,  when  ox  or  or  is  zero 
in  (2.42),  the  invariants  of  o  themselves  remain  constant,  and  hence 
Prager's  condition  is  obeyed  by  the  inconsistent  relation  (2.42). 
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To  close  the  description  of  other  plasticity  features,  such  as  kinematic 
hardening,  we  may  introduce  a  Kirchhoff-stress-like  internal  variable  a, 
which  we  may  label  as  the  finite  deformation  back-stress.  If 

=  a  -  ft*a  +  a*n  ,  (2.^3) 

one  may  introduce  an  isotropic  tensor  function  representation 

a„  =  f(o,a,D)  (2.^iJ) 

^ 

Such  general  representations,  and  specific  examples,  were  considered  by 
Reed  and  Atluri  [22], who  attempted,  successfully,  to  model  the  results  of 
f i ni te~ tor 3 ion  experiments.  For  instance,  Reed  and  Atluri  [21] 
demonstrated  that  an  excellent  agreement  with  the  experimental  results  of 
Swift  [22]  for  stresses  and  strains  in  a  finite-torsion  test,  can  be 
obtained  from  the  simple  model: 

=  C,  dP  >  C„  a'  (2.^15) 

where  =  constant  ;  =  C2(oi':dP)  , 

in  conjunction  with 

Oj  =  A^  -  *2  2'  (2.i*6) 


where  A^  =  constant  ;  A^  =  A^to'-.o)  . 


Another  "nonlinear-kinematic-hardening"  model,  which  is  a  special  case  of 
the  representation  in  (2.M4),  and  a  generalization  of  (1.21),  may  be 
generated  by  writing: 


. ' 


C  Dp  -  8.  a'(Dp:Dp)  * 


(2.47) 


The  primary  reason  for  using  such  models  as  in  (2.45)  and  (2.47)  is  that 
simple  linear  models  of  the  type 

a'  =  C,  dP  (2.48) 

can  easily  be  shown  to  lead  (see  Atluri  [14])  to  oscillatory  values  for 
the  components  of  a  in  cases  such  as  finite  simple  shear.  If  .this 
linearity  is  insisted  upon,  non-oscillator y  back  stress  a  may  be  generated 
by  replacing  the  left-had  side  of  (2.48)  with  other  objective  rates  such 
as  a.Q,  a-f,  or  ac,  as  shown  by  Atluri  [14],  or  by  creating  other 
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stress  rates  such  as  based  on  the  spin  of  material  fibers  aligned  with  the 

principal  directions  of  a,  as  done  by  Lee  [23].  However,  since  modeling 

material  behaviour  under  cyclic  loading  necessitates  the  use  of  nonlinear 

kinematic  hardening  models  as  discussed  in  Section  I  of  this  paper,  it  may 

be  worthwhile  to  circumvent  the  arbitrariness  surrounding  the  left-hand 
side  of  (2.i|8)  [while  retaining  only  the  linear  term  in  DP  as  in  (2.48)] 
and  concentrate  instead  on  the  right-hand  side  of  (2.48)  and  consider  more 
general  representations  involving  DP,  a,  o  as  in  the  right-hand  sides  of 
(2.44)  and  (2.47).  This  type  of  modeling  the  evolution  of  the  back  stress 
a,  along  with  the  use  of  Eqs.  (2.40)  and  (2.14)  for  representing  the  evolu¬ 
tion  of  Kirchhoff  stress  a,  may  form  a  consistent  basis  for  a  flnite- 
elasto-plastic  theory.  This  remains  to  be  verified  through  systematic 
computational  modeling  of  available  experimental  data  on  finite 
plasticity. 
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ABSTRACT .  We  discuss  the  motion  of  nonlinear ^ viscoelastic  materials  with 
fading  memory  in  one  space  dimension*  We  formulate  the  mathematical  problem, 
survey  results  for  global  existence  of  classical  solution  to  the  initial  value 
problem  if  the  data  are  sufficiently  small,  and  discuss  in  detail  the 
development  of  singularities  in  initially  smooth  solutions  for  large  data* 


1*  INTRODUCTION  AND  DISCUSSION  OF  RESULTS*  In  this  paper  we  discuss 
the  motion  of  nonlinear  viscoelastic  materials  with  fading  memory  in  one  space 
dimension*  We  concentrate  on  viscoelastic  solids  and  briefly  remark  on  simlar 
results  for  fluids*  After  formulating  the  mathematical  problems,  we  survey 
results  for  global  existence  of  classical  solutions  to  the  initial  value 
problem,  provided  the  initial  data  are  sufficiently  small.  We  then  discuss  in 
some  detail  the  development  of  singularities  in  initially  smooth  solutions  for 
large  data- 

We  consider  the  longitudinal  motion  of  a  homogeneous  one^dimensional  body 
occupying  an  interval  B  in  a  reference  configuration  and  having  unit 
reference  density.  For  simple  materials,  the  stress  o  at  a  material  point 
X  is  a  nonlinear  functional  of  the  entire  history  of  the  strain  ^  ~  ^ 
the  same  point  x  (here  u  denotes  the  displacement).  In  this  paper,  we 
confine  ourselves  to  the  following  model  problem,  which  can  be  motivated  as  a 
natural  generalization  of  Boltzmann’s  constitutive  relation  for  linear 
viscoelasticity  [1]  (the  derivation  of  similar  results  in  a  variety  of  other 
models  will  be  discussed  in  a  later  paper) 


a(x,t)  =  (p(e(x,t))  +  a*  (t-T  )iKe(x,T)  )dT  , 

(x  e  B,  -<«  <  t  <  «>)  * 

Here  9  and  ^  are  given  smooth  functions  R  ^  R  with 

(p(0)  =  T|i(0)  -  0,  9*  >  0,  >  0  , 


(1.1) 


(1.2) 
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and  for  physical  reasons  the  relaxation  function  a  ;  [0 f^)  R  is  positive, 

decreasing,  convex,  and  a'  6  I>^[0,<»),  where  •  denotes  the  derivative-  The 
conditions  on  a  imply  that  the  stress  relaxes  as  time  increases  and  that 
deformations  which  occurred  in  the  distant  past  have  less  influence  on  the 
present  stress  than  those  which  occurred  more  recently.  Since  only  a* 
occurs  in  the  equation,  we  may  use  the  normalization  a(««)  =  0.  In  the 
rheological  literature  the  relaxation  function  a  is  often  taken  to  be  a 
finite  linear  combination  of  decaying  exponentials  with  positive  coefficients 
obtained  by  a  least  square  fit  to  experimental  data. 

When  (1.1)  is  substituted  into  the  balance  of  linear  momentum,  the 
following  integrodifferential  equation  for  the  displacement  u  results 

^tt  "  ‘P^'^x^x  ^  ^  '  xSB,  t>0  .  (1-3) 

Here  *  denotes  the  convolution  ((X*6)(t)  =  a(  t-T  )6  (t ) dx ,  and  f  is  the 

sum  of  an  external  body  force  and  the  history  term  a'(t-T)^(u  (x,t))  dr - 
An  appropriate  dynamical  problem  is  to  determine  a  smooth  function 
u  :  B  X  (0,»)  R  which  satisfies  (1.3)  together  with  appropriate  boundary 
conditions  if  B  is  bounded,  and  which  at  t  -  0  satisfies  prescribed 
initial  conditions 


u(x,0)  =  Uq(x),  u^(x,0)  =  u-|(x),  X  e  B 

for  certain  smooth  functions  uq  and  u^#  To  avoid  technical  complications 
we  assume  in  the  following  that  f  -  0.  We  also  restrict  ourselves  to  the 
case  of  an  unbounded  body,  B  =  R  and  we  study  the  Cauchy  problem 

Utt  =  ^(^x^x  ^  '  X  e  R,  t  >  0  ,  (1.4) 

u(x,0)  =  Uq(x),  U|.(x,0)  =  u-j(x)  ,  X  6  R  .  (1.5) 

When  a’  E  0  and  <p  satisfies  (1.2),  the  body  is  purely  elastic.  In 

this  case  it  is  well  known  (see  Lax  [14],  MacCamy  and  Mizel  [15],  Klainerman 
and  Majda  [13]),  that  in  general  the  Cauchy  problem  (1.4),  (1.5)  does  not  have 
globally  defined  smooth  solutions,  no  matter  how  smooth  and  small  the  initial 
data  are  chosen.  The  initially  smooth  solution  u  develops  singularities 
(shock  waves)  in  finite  time- 

If  a*  ^  0  and  a  satisfies  the  sign  conditions  above,  the  fading  memory 
term  in  (1.4)  introduces  a  weak  dissipation  mechanism.  Significant  insight 
into  the  strength  of  this  mechanism  was  gained  by  the  work  of  Coleman  and 
Gurtin  [2] ,  who  studied  the  growth  and  decay  of  acceleration  waves  in 
materials  with  memory.  They  showed  that  the  amplitude  q(t)  of  an 
acceleration  wave  propagating  into  a  homogeneously  strained  medium  at  rest 
satisfies  a  Bernoulli-Riccati  ordinary  differential  equation.  The  coefficient 
of  q^  in  this  equation  is  proportional  to  a  second  order  elastic  modulus, 
which  is  given  by  <p**  in  our  model  problem,  and  there  is  a  linear  damping 
term  proportional  to  a'(0).  Thus  the  amplitude  q(t)  =  [u^t^  decays  to  zero 
as  t  provided  |q(0)|  is  sufficiently  small.  On  the  other  hand,  if 

<p"  ^  0,  then  q(t)  >  ®  in  finite  time  if  |q(0)|  is  large  enough,  and 
q(0)  is  of  a  certain  sign*  They  did  not  study  existence  of  such  solutions - 
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This  suggests  that,  under  appropriate  assumptions  on  <p,  i|i  and  a,  the 
Cauchy  problem  (1.4),  (1*5)  should  have  unique,  globally  defined  classical 
(C^)  solutions  for  sufficiently  smooth  and  small  initial  data  Uq,  u^,  while 
smooth  solutions  should  develop  singularities  in  finite  time  if  the  initial 
data  are  large  in  an  appropriate  sense.  Such  a  global  existence  result  for 
small  data  were  recently  established  by  Hrusa  and  Nohel  [10]  using  delicate 
a  priori  estimates  obtained  by  combining  an  energy  method  with  properties  of 
Volterra  equations  (even  in  the  presence  of  a  small  body  force).  We  refer  to 
a  recent  survey  [9]  for  earlier  small  data  results  on  initial  boundary  value 
problems  modelling  the  motion  of  finite  viscoelastic  bodies,  and  for  technical 
simplifications  of  the  analysis  in  the  special  cases  <p  =  or  a(t)  -  e 

For  the  global  results  the  Cauchy  problem  is  more  difficult  than  the  finite 
body  problem  because  the  PoincarS  inequality  is  not  available  to  estimate 
lower  order  derivatives  from  higher  order  derivatives - 

The  remainder  of  our  discussion  will  deal  with  the  formation  of 
singularities  in  finite  time  from  smooth  solutions  of  the  Cauchy  problem 
(1.4),  (1-5).  For  the  special  case  9  =  Markowich  and  Renardy  [17]  have 
obtained  numerical  evidence  for  the  formation  of  shock  fronts  in  finite  time 
from  large  data,  and  Hattori  [7]  has  shown  that,  if  cp"  ^  0  and  if  the  body 
B  is  finite,  then  there  are  smooth  initial  data  (which  he  does  not 
characterize)  for  which  the  corresponding  Dirichlet-initial  value  problem  does 
not  have  a  globally  defined  smooth  solution-  On  the  other  hand,  Hrusa  [8]  has 
shown  that  if  is  linear  and  only  is  allowed  to  be  nonlinear ,  then  the 

Cauchy  problem  (1.4),  (1-5)  does  have  globally  smooth  solutions,  even  for 
large  smooth  data-  Therefore,  we  shall  restrict  ourselves  to  the  case  when 
(p"  ^  0,  at  least  over  the  range  of  the  solution-  The  case  when  9”  changes 
sign  will  require  further  refinements . 

An  essential  ingredient  in  the  analysis  (which  is  also  important  for  the 
global  theory)  is  the  following  local  existence  result  which  is  established  by 
combining  Banach's  fixed  point  theorem  on  an  appropriate  function  space  with 
standard  energy  estimates  and  Sobolev’s  embedding  theorem. 


Proposition  1 : 

- -  Q  1  (*) 

Assume  that  9/  €  C'^(R)  satisfy  (1-2);  assume  a,  a',  a"  e  L^^^[0,«), 

and  there  is  a  constant  k  >  0  such  that 


9’(5)  >  X,  C  e  R  . 

Assume  that  vlq  6  and  that  u^,  6  h2{R).  Then  the  Cauchy  problem 

(1.4),  (1.5)  has  a  unicrue  classical  solution  u  6  C^(R  x  tO,TQ))  defined  on 
a  maximal  Interval  (0,To).  Tq  is  finite,  then 

sup  [|u  (x,t)|  +  !u^(x,t)|]  =“  . 

Bx[0,Tp) 


Here  the  square  bracket  means  integr ability  up  to  0.  No  sign  condition 
on  a  are  required,  but  a’(0)  finite  is  essential. 
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The  proof  of  Proposition  1  is  almost  identical  to  that  of  Theorem  2.1  of 
[6] ,  and  we  omit  the  details;  only  certain  readily  available  energy  estimates 
for  lower  order  derivatives  are  needed.  The  characterization  of  the  maximal 
interval  of  existence  is  established  by  combining  the  energy  estimates 
obtained  in  [6]  with  a  Gronwall  inequality  argument.  We  remark  that  the 
energy  estimates  used  in  the  proof  of  Proposition  1  yield  time-dependent 
bounds  which  cannot  be  used  to  obtain  global  estimates.  These  can  only  be 
constructed  by  taking  advantage  of  the  damping  mechanism  induced  by  the  memory 
term  under  appropriate  sign  conditions  on  a  and  by  assuming  the  initial  data 
to  be  small  (see  [10]  for  details). 

The  assumptions  concerning  the  kernel  a  in  Proposition  1  imply  that 
a'  is  absolutely  continuous  on  [O,*®).  Recently,  Hrusa  and  Renardy  [11] 
established  a  result  similar  to  Proposition  1  (and  proved  a  global  existence 
result  for  small  data  for  bounded  bodies)  under  assumptions  which  permit  a 

singularity  in  a*  at  t  -  0  (e.g.  a' (t)  -t^~\  0<a<1  as  t-*^  0^). 

Such  singularities  are  relevant  for  certain  popular  models  of  viscoelastic 
materials. 

Our  main  result  on  development  of  singularities  for  large  enough  data  is 

Theorem  1; 

Let  cp,  e  C^(R)  satisfy  (1.2)  and  assume  a,  a',  a”  6 

Assume  that  <p”(0)  ^  0.  Then,  for  every  T*|  >  0,  we  can  choose  initial 

data  u^,  u-|  6  C  (R)  n  L  (R)  such  that  the  maximal  time  interval  of 
existence,  given  by  Proposition  1,  for  the  smooth  solution  of  the  Cauchy 
problem  (1.4),  (1.5)  cannot  exceed  T^"!  More  precisely,  if  sup  |ul  (x)  | 

xeR 

and  sup  |u-(x)|  are  sufficiently  small,  while  un(x)  and  u^(x)  assume 

xeR 

sufficiently  large  values  with  appropriate  signs,  then  there  is  some 
t*  <  T-|  such  that 


sup  *  {|u^j^(x,t)|  +  |u^(x,t)|}  ,  (1.6) 

RX[0,t  ) 


while 


sup  *  {|u  (x,t)|  +  |u.(x,t)|}  <  *  (1.7) 

Rx[0,t  ) 

(and  in  fact,  this  latter  quantity  remains  small). 

In  view  of  the  analogy  with  hyperbolic  conservation  laws  and  the 
numerical  evidence  [17],  it  is  to  be  expected  that  a  blow-up  as  established  by 
Theorem  1  will  lead  to  the  development  of  a  shock  front. 

The  method  of  the  proof,  sketched  in  section  2  is  to  show  that  the  memory 
term  is  in  fact  of  lower  order  than  the  elastic  term  ^(^x^x 

treated  as  a  perturbation.  While  considerably  more  technical,  the  proof  is  a 
generalization  of  the  approach  of  Lax  [14]  for  showing  the  development  of 
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singularities  for  the  quasilinear  wave  equation 

=  (p(u  ) 
tt  XX 

Theorem  1  was  established  independently  by  Dafermos  [4]  using  an  approach 
which  is  different  from  ours  but  similar  in  spirit.  The  result  can  also  be 
established  by  modifying  the  results  of  F.  John  [12]  and  extending  them  to 
systems  of  quasilinear  hyperbolic  conservation  laws  which  contain  lower  order 
source  terms  (F*  John,  private  communications). 

Similar  results  for  first  order  model  problems  were  derived  by  Malek* 
Madani  and  Nohel  [16]  and,  using  different  methods,  by  Renardy  [18]  and 
Dafermos  [3] . 


A  particular  case  of  the  model  equation  studied  in  this  paper  leads  to  a 
model  for  shearing  flows  of  viscoelastic  fluids  studied  recently  by  Slemrod 
[20] .  With  v(x,t)  denoting  the  velocity  of  the  fluid  in  simple  shear, 
Slemrod  studies  the  problem 


f  (x  e  R,  t  >  0) 
v(x,0)  -  Vp(x)  ,  (x  e  R) 


(1.8) 


for  the  special  case  a  =  e"*^.  Problem  (1.8)  leads  to  a  Cauchy  problem  of  the 
form  (1.4),  (1.5)  after  differentiation  with  respect  to  time.  Then  Theorem  1 
can  be  used  to  get  a  blow-up  result  for  thl  s  problem,  like  the  result  found  by 
Slemrod  for  a(t)  =  The  global  existence  of  solutions  for  small  data 

follows  from  [5,  Theorem  4,1).  Other  popular  models  for  viscoelastic  fluids 
have  been  analyzed  by  the  method  used  in  this  paper;  the  results  will  be 
published  elsewhere- 


2.  Development  of  Shocks-  In  this  section,  we  sketch  the  proof  of 
Theorem  1  establishing  the  development  of  shocks  from  initially  smooth 
solutions  of  the  Cauchy  problem  (1.4),  (1-5)  in  finite  time.  For  simplicity, 
most  of  the  analysis  will  be  carried  out  for  the  special  case  a(t)  = 
the  proof  for  more  general  relaxation  functions  as  well  as  for  a  more  general 
class  of  model  equations  will  be  carried  out  in  a  forthcoming  paper. 

We  begin  by  transforming  (1.4)  to  an  equivalent  system,  we  let 
w  -  V  =  u^,  and  write  the  constitutive  assumption  (1-1)  in  the  form 

a  -  (p(w)  -  z  ,  z  =  -a**4^{w) 

Since  we  have  assiomed  9*  >  0,  the  first  of  these  equations  can  be 
solved  for  w, 

w  “  9  ^ (a+z)  =;  g(a,z)  , 

and  g  is  a  smooth  function  of  a  e  R,  z  6  R-  As  long  as  the  solution 
remains  smooth,  the  Cauchy  problem  (1.4),  (1.5)  is  equivalent  to  the  first 
order  system 
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t 


V 


t 


a 

X 


=  c^{a,z)v^  +  a*  (0)i|;(g(a,z)  )  +  a”*\|;(g(a,z)  )  , 

=  -a*  (0)T|;(g{a,z))  -  a”*iKg((y,z) )  . 

The  initial  conditions  become 


v(x^O)  =  u^(x),  cy(x,0)  =  (p(u^(x)),  z(x,0)  ==  0  •  (2,2) 

By  c  we  have  denoted  the  wave  speed 

Vo 

c(cr/z)  [<p*  (g(a,z)  ; 

c  is  a  smooth  function  of  a  and  z.  The  system  (2.1)  is  hyperbolic,  and 
its  eigenvalues  are  -i-c,  -c  and  0.  Under  the  assumptions  of  Proposition  1, 
a  C^-solution  exists  on  some  maximal  interval  Rx  [0,Tq).  if  Tq  is  finite, 
then  v,  a,  z  or  one  of  their  first  derivatives  must  become  infinite  as 
t  ->■  Tq.  It  is  immediate  from  equation  (2,1)  that  and  will 

remain  bounded  as  long  as  v,  0,  z,  v^  and  v^^  are  bounded. 

To  proceed  further,  we  extend  the  classical  approach  of  Lax  [14]  for 
first  order  hyperbolic  2  x  2-systems.  We  define  "approximate**  Riemann  in¬ 
variants  by  those  quantities  which  would  be  the  classical  Riemann  invariants 
if  z  in  the  first  two  equations  of  (2-1)  were  treated  as  a  parameter-  These 
quantities  are  given  by 

r  =  r(v,a,z)  =  v  +  $(a,z)  , 


s  =  s(v,a,z)  -  V  -  $(a,z)  , 


(2.3) 


$(a,z)  ^  / 


ac 


0Q  c(C#z)  ' 


without  loss  of  generality  we  may  take  =  0*  Since  $^(0,z)  = 

1 


c(a,z) 


0  -  -  -O' 

>  0,  this  correspondence  is  smoothly  invertible,  and  we  have 


r+s  *  /  X  r-s 


In  the  following,  we  assume  a(t)  «  e"”^.  Then  (2.1)  takes  the  simple  form 


0  =  c^(0,z)v  -  t|;(g(0,z))  +  z  ,  (2-4) 

t  X 

=  'P(g(Of2;)  )  -  z 


We  now  differentiate  r  and  s  along  the  c  and  -c  characteristics, 
respectively,  and  z  along  the  zero  characteristic  (i.e.  we  form 
rt  -  cr^j,  s^  +  cs^  and  z^).  This  leads  to  the  following  first  order 
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hyperbolic  system  equivalent  to  (2.4),  (2-2) 

-  Ar  =  -Bz  +  CD  , 

t  X  X 


+  As  =  “Bz  -  CD  , 
t  X  X 


(2.5) 


with  the  initial  data 

r(x,0)  =  u^{x)  +  #(q>(u^(x)),0)  , 

s(x,0)  =  u^(x)  -  f  (<p(Uq(x)  )  ,0)  ,  (2.6) 

z(x,0)  =  0  f 


A 

B 

C 

D 


A(ir.SrZ) 

B(r,s, z) 


C(r,s, z) 


D(r,s,z) 


:=  c(a(r,s,z) ,z)  >  0  , 


;=  c(a(r,s,z) ,z)$^(a(r,s,z)  ,z)  , 


:=  S/2)  ,z)  )  -  z 


(2-7) 


To  establish  the  development  of  shocks  in  finite  time,  we  study  the 
evolution  along  characteristics  of  the  quantities 


p  V  +  — - - 

X  c(cj,z) 


(2,8) 


T  V - ; - ^  , 

X  c(a,z) 

and  z„*  Note  that  if  z  were  a  constant  parameter,  then  p  and  t  would 

^  1  1 
be  the  x-derivatives  of  r  and  s.  We  have  ~  "2  ^x  ~  ^  c(p-T),  and 


2cc 

CT 


(P"(q(a,z)) 

9'(g(0,z))’ 


A  tedious  but  straightforward  calculation  using  the  relations  (obtained  by 
di f f erentiat ing  (2.4)) 
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XX 

2 


+  (c^)^(a,z)a 


+  [c  )  {a,z)z  V  -  D 

^  XX  X 


V 

X  X 


2tx  “  ' 

yields  the  system 

(c^)^ 

^t  ■  " ~~r~  IpII\I  +  Ul  Iz^l 

+|p|+|t|+|Zx|)  / 

/  2, 

(c  ) 

^t  - - 4~  +  0(|p|  Iz^l  +  |t|  Iz^l 

+|p|+|t|+|zx|)  , 

Zxt  °<IpI  +  1^1  +  l^xl^  • 


subject  to  the  initial  data 

4| 

p(x,0)  =  u^(x)  +  (p' (u^(x)  )'^2  ujjjx)  , 
T(x,0)  =  ul|(x)  -  (p' (u^(x)  )^2  ujifjj)  ^ 

Zx(x,0)  =  0  . 


The  cross  product  terms  pT  in  (2.10)  are  eliminated  if  one  considers 

1  1 

characteristic  derivatives  of  c(cr,z)'^2p  c(a,z)'^2f  (see  Lax  [14] 

Slemrod  [19]).  we  find 

^It  ‘  "  YCc'^^p)^  +  0(  IpI  |z^|  +  1t|  |z^| 

+  1p|  +  |t|  +  |z^|)  , 

‘It  “  Y(c'^2i.)2  +  o{|p1  |z^|  +  |t|  |z^| 

+  |p1  +  |t|  +  |z^|)  , 

^xt  "  °‘|P|  l^xl^  * 


(2.9) 


(2. 10) 


(2.11) 


the 

and 


(2.  12) 
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Here  the  coefficient  function  y  is  given  by 


y  =  y{o,z) 


1  cp“(g(g/z) ) 

(p’  (g(a,2) ) 


For  definiteness,  let  us  assume  <p"(0)  >  0  (the  discussion  for  (p"(0)  <  0  is 
analogous).  We  take  initial  data  with  the  following  properties:  u^  and 
u-j  (and  hence  p(x,0),  t(x,0)  as  well  as  z(x,0)  =  0)  are  uniformly  small, 
and  p(x,0),  T(x,0)  are  such  that  at  least  one  of  them  has  a  large  positive 
maximum  (by  choosing  ug  or  or  both  sufficiently  large) .  At  the  same 

time,  the  maxima  of  -p  and  *t  should  not  be  too  large. 


As  long  as  (r,s,z)  remains  within  a  given  neighborhood  U  of  0,  we 
have  upper  and  lower  bounds  for  the  coefficients  occuring  in  (2.12),  in 
particular,  we  have  a  positive  lower  bound  Yq  Y*  We  shall  see  later 

that  (r,s,z)  will  remain  in  U  up  to  the  time  of  blow-up  if  they  are  small 
enough  initially  and  if  we  make  the  maximum  of  p(x,0)  or  t{x,0)  large 
enough . 

For  every  t  >  0,  we  now  set 

h(t)  =  max[max  p(x,t),  max  T(x,t)] 

X  X 

From  (2.12),  we  find  that,  as  long  as  (r,s,z)  €  U,  while  h(t)  is  large 
and  maxjz  |  <<  h(t),  we  have,  for  some  positive  constants  Yq  and  k 

X 

(4r)^  h(t)  >  Y„(h(t))^,  and  max|z  <  Kh(t)  «  (h(t))^  . 

at  +  U  xt 


Initially,  we  have  jz^^l  =  0  and  it  follows  from  these  inequalities  that  it 
will  remain  small  compared  to  h(t).  We  also  find  that  h(t)  becomes 
infinite  in  finite  time.  Since  there  is  also  some  constant  y^  such  that 
d  2  * 

(■^)  h{t)  <  Y-,(h(t))  /  it  can  be  shown  that,  with  t  denoting  the  blow-up 

dt  +  1  ^ 

^2 

time  of  h,  we  have  — —  <  h(t)  <  ~ —  for  some  constants  and  C2- 

t  -t  t  -t 

The  third  equation  of  (2.12)  then  implies  that  |z^|  grows  at  most 

logarithmically  as  t  t*.  Since  log(t*-t)  is  integrable,  equations  (2.5) 
imply  that  r,  s,  and  z  remain  bounded  and  in  fact  small  if  their  initial 
data  are  small,  and  t*  is  small  (which  is  the  case  if  h(0)  is  large),  in 

this  way,  we  can  choose  the  data  such  that  (r,s,z)  will  in  fact  remain  in 

U  up  to  the  time  of  blow-up.  This  completes  the  sketch  of  the  proof. 
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A  FAST  ALGORITHM  FOR  NON-NEWTONIAN  FLOW 


David  S.  Malkus 
Mathematics  Research  Center 
University  of  Wisconsin  -  Madison 
Madison.  W1  53705 

ABSTRACT.  The  goals  of  the  project  described  here  are  twofold:  First,  to  turn  an  existing 
pilot  algorithm  for  the  steady  flow  of  non-Newtonian  memory  fluids  into  a  robust  and 
efficient  algorithm.  Second,  render  enhancements  of  the  method's  current  capabilities 
computationally  feasible.  Such  enhancements  include  fully  coupled  thermal  dependence, 
material  compressiblity,  and  free  surface  flows.  The  pilot  algorithm  is  a  finite  element 
method  whose  novelty  lies  in  its  computation  of  the  stress  field  in  a  nonlinear  iteration 
scheme.  The  stress  at  a  point  is  a  non-local  functional  of  the  current  velocity  iterate, 
and  the  pilot  method  has  demonstrated  the  feasiblity  of  reliable  computation  with  such 
constitutive  ecjuations.  Before  the  method  can  take  its  place  as  a  reliable  scientific  and 
engineering  tool,  intensive  effort  must  be  made  to  reduce  the  computational  cost  in  the 
manner  decribed  here. 


I.  VISCOELASTIC  FLUIDS.  The  following  equations  are  solved  numerically,  using  the 
finite  element  method  1  —  4':  The  equations  of  steady  motion. 

V  •  a- f  = /)(u  •  V)u  (1) 

where  u  is  the  velocity  field,  a  the  stress  tensor,  f  a  body  force,  and  p  the  density.  The 
equation  of  continuity  for  an  incompressible  fluid  is 


V  ■  u  =  0  (2) 

For  non-Newtonian  fluids,  the  crucial  equation  is  the  constitutive  equation, 

a  ~  -pi  -H  2p(0)i?e  —  (l  -  R)a'  (3) 

where  p  is  an  isotropic  contribution  to  the  stress,  p(0)  is  a  zero-shear  viscosity,  R  is  a  ratio 
of  a  retardation  time.  A.  to  a  retardation  time.  T.  and  a'  is  an  extra  stress  tensor.  The 
ratio,  R,  and  its  complement  determine  the  proportion  of  the  stress  which  is  New'tonian 
—  and  usually  is  the  result  of  a  Newtonian  solvent  —  and  the  complementary  proportion 
from  the  extra  stress  —  usually  due  to  long-chain  molecules  (such  as  polymers)  disolved 
in  the  Newtonian  solvent. 


There  are  many  proposed  forms  for  the  extra-stress  tensor;  there  are  two  basic  cat¬ 
egories:  the  differential  and  integral  models  1  .  Here  we  shall  only  be  concerned  with  the 
integral  form. 


m/(r) 


k-  ] 


(t) 

lPk_ 

f 


(4) 
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where  /in  is  a  constant  determining  /i(0),  and  is  a  strain  measure,  measuring  tlie 

deformation  which  carried  the  particle  from  its  position  at  time  r  in  the  past  to  the  stress 
evaluation  point  at  the  present  time,  0.  The  strain  measures  are  the  same  kind  employed 
in  finite  elasticity.  The  memory  functions,  m/,  are  usually  sums  of  e.xponentials,  each  with 
amplitude  determiiied  by  the  modulus,  and  decay  constant,  which  determines 
the  fraction  of  the  basic  decay  rate,  T.  Thus  a  computational  method  must  deterniiiie 
the  deformation  history  of  every’  stress  evaluation  point  required  to  solve  the  equations  of 
motion  in  some  approximate  way.  compute  the  required  strain  measure  —  which  is  almost 
always  highly  nonlinear  in  its  dependence  on  the  velocity  field  —  and  then  approximate 
the  history  integral  over  an  infinite  interval.  This  just  computes  the  stress,  and  then  the 
stress  computation  must  be  imbedded  in  some  iterative  scheme  to  produce  an  approximate 
solution  to  the  highly  nonlinear  equations  of  motion. 

II.  SOME  FLOWS.  In  this  section  we  give  a  brief  description  of  some  of  the  flows  to  which 
the  current  method  is  being  applied.  The  geometry  of  these  flows  is  quite  simple  and  the 
results  obtained  do  not  illustrate  the  real  power  of  the  finite  element  method.  It  is  hoped 
that  the  reader  will  appreciate  that  the  method  described  here  is  siill  very  much  in  the 
development  stage,  and  that  tlie  problems  so  far  investigated  by  the  author  and  other 
researchers  are  intended  to  isolate  the  complexity  Inherent  in  the  non-Newtonian  nature 
of  the  flow  from  other  possible  cotnplicalions.  Nevertheless,  there  seen\s  to  be  a  good  deal 
of  physical  interest  in  the  problems  pictured  here,  in  spite  of  their  geometric  simplicity. 


Flow  over  a  transverse  slot  computed  by  the  pilot  method  on  a  rnosh  of  1008  clemcriis. 
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The  first  flow  is  a  plane  flow  over  a  transverse  slot.  The  slroaTnlines  plotted  in  J'igiire 
1  are  taken  from  a  solution  computed  by  the  author,  using  a  constitutive  equation  of  his 
own  devising  [Ij  and  the  rnesh  of  1008  crosed-triangle  macroelements  illustrated  in  ref.  1. 
The  flow  is  at  a  “Deborah  number"  of  4.7  (this  can  be  thought  of  as  a  dimensionless  shear 
rate).  Flow  is  from  right  to  left,  and  undisturbed  flow  profiles  have  been  imposed  at  the 
inflow  and  outflow.  Actually,  since  there  is  fluid  memory,  the  inlet  condition  is  that  the 
flow  continues  forever  upstream  as  undisturbed  plane  Poiseuille  flow.  Figure  1  illustrates  a 
chracteristic  tilt  to  the  vortex  in  the  slot,  which  is  opposite  in  direction  to  the  tilt  observed 
in  Newtonian  flows  with  non-zero  Reynolds  number  |3]. 

The  interest  in  flows  over  transverse  slots  arises  from  the  fact  that  there  seems  to  be 
an  important,  relation  between  the  difference  between  the  pressures  at  top  and  bottom  of 
the  slot  and  the  first  normal-stress  difference  of  the  fluid  in  the  undisturbed  flow  [1  -  3J. 
There  seems  to  be  a  di.screpancy  between  what  the  numerical  models  predict  and  laboratory 
experiments  measure  in  such  flows,  and  it  is  one  of  the  author’s  liighest  priorities  to  resolve 
this.  The  results  could  have  important  ramifications  for  devices  designed  to  measure  the 
first  normal-stress  difference  using  “hole-pressure”  measurements. 


FlCaiRE  2 

Abrupt  contraction  flow  computed  by  the  pilot  method  with  940  elements. 

The  next  flow  is  that  of  flow  through  an  abrupt,  planar  contraction.  Figure  2  pictures 
such  a  flow;  flow  is  from  left  to  right  so  that  the  fluid  is  being  forced  form  the  larger  to 
the  smaller  channel.  Because  symmetry  is  assumed,  the  computational  domian  is  only  the 
top  half  of  a  channel  cro.ss-section.  The  flow  pictured  here  uses  the  same  fluid  model  as 
that  of  Figure  I,  at  a  slightly  lower  Deborah  number.  3.7.  Inflow  and  outflow  conditions 
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arc  imposed  as  before;  extreme  care  is  taken  to  match  the  flux  al  inflow  and  outflow 
boundaries. 

The  interest  in  this  flow  stems  from  the  fact  that  some  fluids  seem  to  behave  quite 
differently  than  others  in  contraction  flow.  Some  fluids,  such  as  polystyrene  or  high-den.sity 
polyethelyne  melts,  seem  to  have  relatively  smaller  “dead-spaces”  or  recirculation  regions 
at  high  Deborah  number  than  at  low  Deborah  number,  while  some  f)ranched  polymers, 
such  as  low-density  polyethelyne,  seem  to  do  quite  the  opposite,  developing  recirculation 
regions  emanating  from  entry  which  dominate  the  whole  flow-field.  The  flow  pictured  in 
Figure  2  has  about  the  same  size  recirculation  region  as  a  flow  of  the  same  fluid  at  low 
shear  rate.  The  author  is  interested  in  further  study  of  this  flow  in  order  to  find  out  what 
property  of  the  constitutive  model  is  associated  with  entry  vortex  behavior.  The  ability 
to  predict  recirculation  size  is  of  practical  import  because  fluid  trapped  inside  dead-spaces 
tends  to  degrade.  It  would  be  useful  to  able  to  determine  how  much  polymer  degradation 
could  be  expected  in  a  given  die  as  a  function  of  measurable  material  properties. 


FIGURE  ,3 

Newtonian  flow  in  a  plane  cross-section  of  a  journal-boaring. 

The  final  flow  is  one  for  which  the  author  has  as  yet  no  results;  that  is  the  flow  in 
a  journal-bearing.  Figure  3  pictures  the  cross-section  of  two  eccentrically  placed  cylinders 
with  fluid  between  them  to  lubricate  and  prevent  solid-to-solid  contact.  There  are  two 
important  aspects  to  this  flow  which  differ  from  the  previous  two  flows:  First,  there  are 
no  inflows  and  outflows,  and  second,  there  are  no  domain  corners  to  generate  singularities 
in  the  stress-field. 
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The  author's  interest  in  this  problem  is.  first,  just  that  it  is  quite  different  from  the 
other  two.  It  will  be  interesting  to  observe  the  behavior  of  the  numerical  method  here 
because  it  omits  two  puzzling  aspects  of  memory-fluid  problems:  history-dependence  at 
inlets,  and  stress-singularities  of  unknown  character.  There  is  also  physical  interest  in  this 
problem  because  the  effects  of  fluid  elasticity  on  the  load-bearing  capacity  of  the  bearing 
may  be  beneficial.  It  would  be  able  to  predict  load  bearing-capacity  from  measurable 
material  properties,  and  numerical  modelling  may  help  to  do  so. 

III.  COMPUTATIONAL  METHOD.  The  strain  measure  in  the  integrand  of  eq.  (1)  is 
assumed  to  be  determined  by  a  deformation  gradient.  Eo(r),  just  as  in  nonlinear  elasticity. 
Only  in  the  present  case,  the  deformation  gradient  is  assumed  to  be  computable  from  a 
system  of  linear,  non-constant  coefficient  ordinary  differential  equations  along  the  path 
followed  by  each  particle  at  which  the  stress  is  to  be  evaluated.  The  usual  deformation 
gradients  of  large-strain  elasticity  can  also  be  obtained  from  special  cases  of  the  following 
evolution  equations: 

x(r)  =  v:x(r), 
x(0)  =  Xo 

•  (5) 

Eo(r)  =F(x(r),Vv|x(r)])Eo{r) 

Eo(0)  -  I 

The  first  two  sets  of  equations  determine  the  pathline  (streamline)  followed  by  a  particle 
to  bring  it  from  its  position,  x,  at  time  t  in  the  past  to  its  present  position  at  the  stress 
evaluation  point.  Xq.  To  evaluate  the  integrand  of  eq.  (4),  these  equations  are  solved  as 
an  initial  value  problem  in  reverse  time.  This  determines  the  non-constant  coefficient  in 
the  evolution  equation  for  the  gradient,  which  is  assumed  to  be  a  traceless  matrix,  F.  The 
common  deformation  gradient,  is  obtained  when  F  is  Vv  itself. 

The  fundamental  strategy  of  the  current  numerical  method  is  to  choose  constant 
strain-rate  finite  elements:  then  the  evolution  equation  is  a  constant-coefficient  equation 
on  each  element.  This  strategy  is  enabled  by  a  basic  property  of  linear  ODEs:  If  we  define 
a  deformation  gradient,  E^j,  relative  to  time  7]  by 

E.,  -  FE,, 

E.,(ri)  =  I 

then  the  strain  relative  to  the  present  time,  evaluated  at  any  earlier  time  is  give  by  matrix 
multiplication: 

Eo(0  -  Eo(ri)E,,(r)  (7) 

This  provides  interface  conditions  between  finite  elements,  so  that  only  constant-coefficient 
equations  need  be  solved  on  each  element:  it  Turns  out  that  such  solutions  are  known 
analytically,  a.s  is  the  pathline  and  transit  time  along  it  1  —  2  . 

Thus,  given  an  estimate  of  the  solution  to  the  problem  it  terms  of  a  velocity  field, 
the  integrand  of  eq.  (4)  can  straight-forwardly  be  computed  at  each  historical  time.  In  the 


(6) 
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current  method,  this  is  used  in  conjunction  with  a  specially  devised  Gaussian  quadrature 
formulas  to  approximate  a': 

A’, 

So(^)w(r)  dr  (8) 

A'=] 

With  what  we  have  thus  far.  the  stress  can  be  computed  in  any  trial  velocity  field;  to 
approximate  the  solution  to  eqs.  (1)  and  (2),  the  usual  Galerkin  procedure  can  be  followed 
in  which  the  residual  of  eq.  (1)  is  dotted  into  a  test  function,  v^,  drawn  from  the  same 
space  as  the  trial  solutions,  and  the  result  is  integrated  over  the  problem  domain.  After 
integration  by  parts  and  replacement  of  the  spatial  integral  by  a  numerical  integral  with 
points  and  weights  6^.  we  get  something  which  looks  like 

Y2  -  2s(V  ■  u'’)(V  ■  v'^)  +  p(u  •  V)u'’  .  v'‘  -  •  f)(Ce)  =  0  (9) 

£ 

The  pressure  term  of  eq.  (3)  has  been  replaced  by  a  penally  term  ;3^  with  penalty  parameter 
z:  thus  there  are  no  explicit  pressure  unknowns,  and  the  continuity  equation  (2)  is  satisfied 
to  0[z~^),  Eq.  (9)  illustrates  the  =  0  case;  for  nonzero  R,  the  obvious  modification  of 
adding  a  Newtonian  viscous  term  is  made. 

The  important  point  to  observe  about  eq.  (9)  is  that  to  evaluate  its  residual,  it  is 
required  to  evaluate  the  stress  at  the  points  fe  by  means  already  discussed.  To  complete  the 
method,  what  is  needed  is  a  means  of  correcting  estimates  of  the  discrete  solution,  based  on 
evaluation  of  the  residual;  New'ton’s  method  might  an  example  of  such  a  procedure,  but,  as 
we  shall  see,  this  is  not  entirely  straight-forward.  The  current  algorithm  employs  the  inverse 
Broyden  method  :1,2|  to  solve  the  discrete  nonlinear  equations.  An  important  point  to  be 
made  here  is  that,  regardless  of  the  choice  of  iterative  scheme,  the  method  outlined  here  is 
enormously  costly  in  practice,  because  for  a  reasonably  fine  mesh,  each  evaluation  of  the 
stress-field  values  at  the  spatial  integration  points  is  a  potentially  formidable  computation. 

IV.  FAST  ALGORITHMS.  The  method  outlined  in  the  previous  section  applies  to  isother¬ 
mal,  incompressible  flows  in  a  fixed  spatial  domain.  These  restrictions  are  not  essential; 
material  compressibility  and  temperature  dependence  can  be  handled  in  very  similar  fash¬ 
ion  if  “artificial  (historical)  time”  is  introduced,  in  which  either  density  or  temperature 
are  used  to  change  the  time  variable  along  the  pathlines  in  such  a  way  that  a  traceless 
matrix  in  the  evolution  equation  is  obtained  [5i.  The  transformation  to  artificial  time  does 
not  in  itself  seem  to  be  computationally  costly,  but  these  problems  involve  added  levels  of 
complexity  to  an  already  complicated  solution  procedure  with  additional  fields  and  corre¬ 
sponding  equations.  The  resulting  phenomena  are  likely  to  be  more  intricate  in  detail  and 
more  nonlinear  in  character.  A  similar  observation  can  be  made  about  free-surface  flow's; 
a  well-developed  methodology  exists  ib;  to  solve  such  problems,  w'hich  can  be  directly  in¬ 
terfaced  W'ith  the  method  outlined  here,  but  this  also  certain  to  render  the  computations 
more  formidable  than  they  now  are.  With  the  current  algorithm,  computations  on  a  mesh 
which  is  refined  only  to  the  extent  which  seems  to  be  required  to  obtain  acceptable  accu¬ 
racy,  at  a  shear  rate  normally  occuring  in  polymer  processing,  the  computation  of  a  steady 
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solution  can  take  as  long  a  40  minutes  on  a  Cray  1-A.  This  must  be  reduced  drastically  if 
the  algorithm  is  to  used  routinely  in  scientific  and  engineering  research,  particularly  if  the 
more  physically  realistic  enhancements  mentioned  above  are  to  be  added.  The  remainder 
of  this  paper  will  discuss  several  approaclies  to  the  reduction  in  computational  cost  which 
are  currently  being  implemented  or  investigated  by  the  author. 

Vectorization  of  Linear  Equation  Solving.  A  variety  of  new  computers  have  the  capability 
of  carrying  out  hardware  vector  operations;  rearranging  the  computer  code  in  such  a  w’ay 
that  the  compiler  can  take  advantage  of  this  capability  can  result  in  substantial  savings  in 
computational  cost.  One  part  of  a  typical  code  where  such  savings  have  a  good  chance  of 
being  realized  is  in  the  solution  of  linear  equations.  Unfortunately,  in  the  current  algorithm, 
it  is  not  expected  that  this  can  dramatically  reduce  the  run  time.  Linear  equations  are 
solved  in  the  nonlinear  iteration  scheme,  but  this  appears  to  account  for  a  small  portion  of 
the  computational  cost.  The  major  portion  of  the  calculation  is  carried  out  at  the  element 
level  with  small  arrays  or  scalar  quantities  involved  in  resolving  element  boundary  crossings 
and  accumulating  the  deformation  gradient  by  small  matrix  multiplication.  Vectorization 
offers  little  hope  of  speeding  up  these  calculations.  On  the  the  other  hand,  it  is  expected 
that  linear  equation  solving  will  begin  to  play  a  more  and  more  important  role  with  the 
planned  enhancements  to  the  code  discussed  earlier.  The  Jacobian  terms  corresponding  to 
the  thermal  energy  equation  are  easy  to  form:  likewise  the  part  of  the  Jacobian  associated 
with  inertial  terms  and  the  unknown  free*surface  transformation  are  easy  to  deduce.  Also, 
active  research  is  under  way  aimed  at  producing  the  Jacobian  terms  associated  with  the 
non-Newtonian  viscous  terms  (see  below). 

In  short,  the  future  development  of  the  code  seems  to  point  in  the  direction  of  an 
algorithm  w’hich  has  a  large,  unsymmetric,  and  possibly  not  banded  matrix  to  factor  at 
each  one  of  dozens  of  possible  iterates.  The  current  iteration  method  has  only  one,  banded, 
symmetric,  positive-definite  matrix  to  factor  at  the  outset,  and  a  back-substitution  at  each 
iteration  (the  unsymmetric  Jacobian  contribution  of  the  inertial  terms  is  left  to  the  inverse 
updating  scheme).  It  therefore  seems  appropriate  to  modify  the  code  at  the  present  time 
to  take  full  advantage  of  vectorization,  in  order  to  make  sure  the  lineitr  equation  solving 
phase  remains  in  the  baeground.  as  it  should. 

.Adaptive  Memory  Quadrature.  The  area  which  seems  to  show  most  promise  in  reduction  of 
the  the  computational  cost  is  that  of  the  stress  calculation  at  an  individual  stress  evaluation 
point.  There  seem  to  be  several  possible  approaches,  the  underlying  strategy  of  all  of  them 
is  to  take  advantage  of  the  fact  that  the  stresses  are  being  evaluated  in  what  is  hoped  will 
be  a  convergent  sequence  of  velocity  iterates.  Particuarly  further  along  in  the  sequence, 
the  previous  iterate  should  be  able  to  provide  a  guide  to  estimate  how  much  computation 
is  absolutely  necessary  at  the  next  iteration. 

Perhaps  the  most  obvious  way  to  do  this  is  to  use  the  previous  iterate  to  determine 
what  Np  of  eq.  (8)  should  be  in  the  next  iterate.  It  is  observed  that  in  some  flows,  very 
many  fewer  quadrature  points  are  needed  to  accurately  compute  a'  than  in  other  flow’s. 
The  strategy  will  be  to  begin  the  iterations  with  a  nominal  number  of  points  for  each  stress 
evaluation  point  and  increase  or  decrease  that  number  in  succeeding  iterations,  based  on 
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adaptive  criterea  determined  from  previous  iterations. 


Jacobian  Approximation.  The  reason  that  the  present  algorithm  employs  an  updating 
scheme  rather  than  a  direct  calculation  of  the  Jacobian  is  that  it  is  not  a  trivial  mat¬ 
ter  to  construct  the  Jacobian,  or  even  write  down  a  closed  form  expression  for  it.  It  is 
clear  that  the  stress  at  a  point  can  depend  on  velocities  far  from  that  point,  and  therefore 
the  Jacobian  of  the  residual  of  eq.  (t>)  cannot  have  the  usual  finite  element  band  and/or 
sparsity  structure.  In  ref.  7,  an  approximation  scheme  for  the  Jacobian  is  proposed.  It  is 
not  clear  at  present  whether  this  approximation,  some  other,  or  even  an  exact  computation 
of  the  Jacobian  is  best  (the  latter  may  be  possible  to  undertake  —  it  is  not  clear  at  this 
time).  But  the  work  of  ref.  7  shows  clearly  the  complexity  involved.  The  terms  of  the 
Jacobian  contribution  from  the  extra  stress  are  computed  by  tracking  along  streamlines. 
The  resulting  Jacobian  element  matrices  are  not  square:  Their  column  dimension  depends 
on  the  number  of  different  elements  the  particle  path  passes  through  before  the  final  in¬ 
tegration  point  of  eq.  (8)  is  located.  It  appears  that  a  frontal  solution  technique  is  called 
for  in  order  to  handle  the  resulting  global  matrix  7:. 

Pseudo-Dynamic  Relaxation.  The  fiope  in  computing  the  Jacobian  is  that  the  computa¬ 
tional  cost  will  be  more  than  returned  in  improved  convergence  rate  over  the  inverse  Broy- 
den  algorithm  afforded  by  Newton's  method  or  modified  Newton's  method  with  Broyden 
updates.  But  the  complexity  of  the  Jacobian  calculation  is  such  that  this  may  never  be  real¬ 
ized.  and  it  is  well  worth  the  investigation  of  other  improvements  to  the  iterative  solution  of 
the  nonlinear  equations.  One  avenue  currently  being  explored  is  that  of  “pseudo-dynamic 
relaxtion."  The  problem  is  cast  as  a  time-dependent  problem  and  steady  solutions  are 
obtained  by  letting  the  transient  phenomena  die  out.  In  the  algorithm  presented  here, 
the  transient  behavior  does  not  represent  the  true  d5mamics  of  the  non-Newtonian  fluid: 
the  stress  is  computed  in  the  current  velocity  field  as  if  it  had  been  a  steady  field  for 
all  time,  hence  the  name  “pseudo-dynamic.”  To  do  otherwise  would  involve  complexities 
beyond  what  seems  manageable  at  present,  though  implementation  of  the  pseudo-dynamic 
algorithm  does  open  the  door  for  future  exploration  of  true  dynamic  behavior. 

One  may  easily  verify  that  the  steady-states  of  the  pseudo-dynamic  algorithm  are 
the  same  as  the  steady  states  of  the  true  dynamic  algorithm.  The  reason  for  taking  the 
pseudo-dynamic  approach  is  to  produce  a  different  kind  of  steady-state  iteration  scheme, 
in  which  the  damping  of  the  high  frequency  modes  in  the  pseudo-dynamic  response  can 
be  controlled  by  choice  of  time-stepping  method  and  pseudo-time  step.  The  reason  that 
this  seems  to  be  a  worthwhile  avenue  to  explore  is  suggested  by  recent  work  of  Y.  Re- 
nardy  and  M.  Renardy  'Sl.  They  found  that  with  a  certain  spatial  discretization  of  the 
linearized  operator  associated  with  the  equations  of  motion  of  a  Maxwell  fluid  in  a  shearing 
flow,  there  were  apparently  spurious  eigenvalues  extremely  close  to  the  right  half-plane, 
evidently  introduce  by  the  discretization.  If  this  were  also  a  consequence  of  finite  element 
discretization,  there  could  be  severe  consequences  for  iterative  methods  which  behave  like 
temporal  iteration  schemes.  It  is  hoped  that  by  controlling  the  time  step  and  parameters 
of  the  pseudo-dynamic  time-stepping  method,  the  damping  of  the  high  frequency  modes 
associated  with  any  spurious  eigenvalues  can  be  damped  to  produce  nearer  monotonic, 
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more  rapid  convergence  of  the  resulting  iterative  method.  There  is  no  worry  here  of  damp¬ 
ing  out  interesting  transient  behavior  —  the  transient  behavior  is  not  correct,  and  all  that 
it  is  required  is  that  it  be  damped  out  as  rapidly  as  possible. 

The  following  algorithm  is  based  on  Hughes,  Liu.  and  Brooks  pedictor-corrector 
algorithm  for  the  Navier-Stokes  equations  ;9  .  but  with  the  possibility  of  more  fully  implicit 
inner  iterations  at  each  time  step; 

Man^l  -  CVn^rl  ~  N(Va^l)  -  Q(v„_j)  = 


Q  /  B^adV~Cv 

Ju 

N  —  nonlinear  inertial  term.  excl.  time  deriv. 

Vn  -  (1  -  i)Ata„ 

-  J-^iM  ^  -  oAtQ(vi;i^) 

4  ^  A/N(vi;!  i)  -  ^  7AtF„.  1} 

an^l  =  (Vn-1  "  vlfJi)/7A/ 

J  r.  M  -  ('A/  ^■)  ^  (7  A/  ) 

d\  ■  ‘--•r'-t-  '  '  d\  ^  “J"' 


M  is  the  finite  element  “mass  matrix,”  C  the  Newtonian  viscous  and  penalty-pressure 
matrix,  and  F,^,.:.!  the  applied  force  vector  at  time  step  n  —  1.  B  is  the  usual  finite 
element  matrix  of  shape  function  derivatives  and  a  is  the  stress,  computed  in  as 

described  in  previous  sections:  without  the  superscript  of  inner  iteration  is  the  “fully 

converged”  result  of  inner  iteration  at  time-level  rt  -  1.  Choice  of  the  number  of  inner 
iterations  is  open,  so  that  Vn^i  could  result  from  just  one  correction  cycle,  or  many.  An 
important  aspect  of  eq.  (10)  is  found  in  those  terms  labelled  by  (Oor'C  ^  fully  implicit 
treatment  would  employ  exact  Jacobian  terms  here.  At  the  other  extreme  is  Hughes,  Liu, 
and  Brooks  method;  they  use  C  to  approximate  both  of  these  terms  and  do  only  one  inner 
iteration.  The  present  non-Newtonian  implementation  uses  C  initially,  updated  by  the 
inverse  Broyden  method  during  a  number  of  inner  iterations.  If  it  proves  to  be  effective, 
Newton  or  modified  Newton/Broyden  iterations  could  be  employed  in  the  inner  iterations. 
It  is  instructive  to  note  that  the  direct  steady-state  Broyden  algorithm  mentioned  earlier 
is  obtained  as  a  special  case  of  eq.  (10)  with  7  -  1  and  an  infinite  time  step. 


V.  SUMMARY.  A  pilot  numerical  method  for  the  computation  of  solution  to  memory 
fluid  flow  problems  has  been  described.  This  method  has  showm  that  such  computations 
are  feasible  but  extremely  costly.  More  reasonable  physical  assumptions  than  those  of 
isothermal,  incompressible  flow  in  a  fixed  domain  are  on  the  near  horizon  but  are  bound 
to  increase  the  computational  cost.  A  number  of  ways  of  improving  the  computational 
performance  of  the  algorithm  have  been  proposed  here  and  are  in  the  implementation 
stage.  These  improvements  will  go  together  to  make  what  the  author  refers  to  as  “a  fast 
algorithm  for  non-New  tonian  flow.”  It  is  hoped  that  this  fast  algorithm  can  transform  the 
method  described  here  from  pilot  code  to  useful  computational  tool  for  the  investigation 
of  problems  in  viscoelasticity  and  rheology. 
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ABSTRACT  A  modified  Riemann  problem  in  which  the  initial  and 

boundary  conditions  are  constants  are  considered  for  plane  waves  in  a 
half  space  occupied  by  a  simple  elastic  solid*.  The  governing 
quasi linear  differential  equations  are  a  system  of  hyperbolic 
conservation  laws  which  possesses  three  wave  speeds  c^  >  c^  >  c^* 

The  system  is  genuinely  nonlinear  with  respect  to  c^  and  c^  and 
linearly  degenerate  with  rspect  to  c^*  Thus  it  is  sufficient  to  study 
a  two-wave  speed  system  with  c^  and  C3-  Wave  curves  for  simple  waves 
and  shock  waves  are  used  to  construct  the  solution.  Second  order 

hyperelastic  materials  which  contain  four  material  constants  are 
considered  and  the  solution  in  the  form  of  wave  curves  are  obtained  for 
all  possible  combinations  of  initial  and  boundary  conditions.  With  a 
proper  nondimens ionalizat ion,  the  wave  curves  depend  only  on  one 
material  parameter  k.  The  solutions  are  thermodynamically  correct 
because  the  entropy  effects  does  not  come  into  picture  until  the  third 
order  in  stresses  are  included  in  the  constitutive  laws.  The  two-wave 
speed  system  have  one  umbilic  point  at  which  “  c^  and  hence  the 
system  is  not  totally  hyperbolic  (or  not  strictly  hyperbolic).  Several 
interesting  and  unexpected  results  are  obtained  due  to  the  existence  of 
the  umbilic  point.  In  one  example,  we  find  that  a  shock  wave  satisfies 
Lax  stability  condition  for  a  V^  shock  as  well  as  a  V^  shock.  In 
another,  a  shock  wave  which  involves  only  one  stress  component  does  not 
satisfy  Lax  stability  condition  for  either  V^  shock  or  V^  shock. 
However,  it  satisfies  Lax  stability  condition  if  we  consider  it  under 
the  context  of  one-wave  speed  system.  Finally  we  consider  the  effects 
on  the  solution  when  the  third  order  therms  are  included.  We  show  that 
although  the  entropy  affetcs  the  shock  wave  solution,  it  does  not  appear 
in  the  simple  wave  solution  until  the  fourth  order  terms  are  included. 
With  the  third  order  terms,  there  are  in  general  two  umbilic  points,  one 
of  which  may  be  an  umbilic  line* 

EXTENDED  SUMMARY  The  Riemann  problem  is  a  special  case  of  the 

Cauchy  problem  in  which  the  initial  value  is  a  constant  for  X  >  0  and  a 
different  constant  for  X  <  0.  Physically,  this  may  represent  the 
problem  of  a  fluid  in  a  tube  with  different  initial  pressures  on  both 
sides  of  a  diaphragm  located  at  X  “  0.  The  solution  to  the  Riemann 
problem  yields  the  fluid  motion  after  the  diaphragm  is  ruptured.  The 
problem  we  will  study  here  is  the  plane  waves  in  an  elastic  half  space 
X  ®  X^  ^0  which  is  prestressed  at  time  t  =  0.  For  t  >  0,  a  constant 
traction  is  applied  at  the  boundary  X  -  0.  This  is  a  ^modified”  Riemann 
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problem. 


The  governing  quasilinear  partial  differential  equations  for  the 
problem  are  a  system  of  hyperbolic  conservation  laws.  Extensive 
coverages  of  the  subject  can  be  found  in  Courant  and  Friedrichs  (19A8) , 
Jeffrey  (1976),  Lax  (1957,  1973)  and  Dafermos  (1983),  for  example. 

Inherited  with  the  solution  to  the  quasi^linear  system  is  the  weak 
solution  or  the  shock  wave  in  which  the  solution  is  discontinuous*  The 
appearance  of  shock  waves  in  the  solution  is  a  general  rule  rather  than 
an  exception* 

Plane  finite  amplitude  waves  in  simple  elastic  solids  have  been 
studied  by  many  investigators*  Chu  (1964)  and  Collins  (1966)  considered 
incompressible  materials,  Bland  (196Aa,b,  1965)  and  Davison  (1966) 

studied  compressible  elastic  solids  while  Howard  (1966)  investigated 
waves  in  transversely  isotropic  materials.  For  the  Riemann  problem,  the 
solution  in  general  consists  of  simple  waves  and/or  shock  waves  (Lax 
1957,  Liu  1975,  Smoller  1969b)*  The  Crux  of  the  problem  is  in  the 
determination  of  the  correct  sequence  of  simple  waves  and  shock  waves  to 
satisfy  the  initial  and  boundary  conditions.  Most  invetigators  used  a 
semi-inverse  approach,  i.e.  one  assumed  a  combination  of  simple  waves 
and  shock  waves  to  see  what  initial  and  boundary  conditions  were 
satisfied*  A  direct  approach  using  the  wave  curves  for  simple  waves  and 
shock  waves  was  employed  by  Li  and  Ting  (1983) .  The  idea  has  been  used 
earlier  by  Clifton  (1966)  and  Ting  and  Nan  (1969)  where  the  wave  curves 
are  called  "stress  paths".  However,  the  problems  studied  by  them  were 
for  elastic^^plastic  materials  in  which  the  elastic  part  was  linear  and 
hence  the  wave  curves  for  shock  waves  were  straight  lines.  In  the  paper 
by  Li  and  Ting  (1983),  the  second  order  hyperelastic  material  was 
studied  for  a  special  case  in  which  the  material  was  initially  stress 
free*  In  the  present  paper  we  extend  their  problem  to  include  arbitrary 
initial  condition*  We  also  consider  the  effects  on  the  solution  of 
including  the  third  order  terms  in  the  constitutive  equations. 

In  Chapter  II,  the  basic  equations  for  the  problem  are  developed. 
The  material  is  assumed  to  be  isotropic  simple  elastic  solids* 
Following  Li  and  Ting  (1983),  we  use  the  stresses  instead  of  the 
deformation  gradients  as  the  dependent  variables.  This  is  due  to  the 
fact  that  it  is  more  natural  to  prescribe  stress  rather  than  deformation 
gradiet  (or  strain)  as  the  initial  and  boundary  conditions*  With  the 
assumption  of  plane  waves  in  isotropic  simple  elastic  solids,  the 
deformation  gradients  are  functions  of  C7  arid  only,  where  a  and  r 
are,  respectively,  the  normal  and  shear  stress  on  a  X  =  *  constant 

plane*  The  angle  the  shear  stress  makes  with  respect  to  X^  axis  is 
denoted  by  d.  Since  the  basic  solutions  to  the  Riemann  problem  involve 
s impel  waves  and  shock  waves,  we  discuss  simple  wave  solutions  in 

Chapter  III  and  shock  wave  solutions  in  Chapter  IV* 

A  simple  wave  solution  represents  a  wave  fan  on  the  (X,t)  plane  in 
which  the  stresses  are  constants  along  the  straight  lines  passing 
through  the  origin.  As  the  slope  of  the  straight  lines  varies,  the 
stresses  trace  a  curve  in  the  stress  space  which  is  called  the  simple 
wave  curve.  There  are  three  simple  wave  curves  associated  with  the 
three  wave  speeds  c^  ^  c^  ^  c^.  Using  the  cylindrical  coordinate 
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system  Ca,T,0)  for  the  stress  space,  it  is  shown  that  the  simple  wave 
curves  associated  with  and  lie  on  a  radial  plane 

(6  «  constant)  and  hence  are  plane  polarized.  The  simple  wave  curve 
associated  with  is  a  circle  (a  =  constant  and  t  =  constant)  and 
hence  is  circularly  polarized.  These  results  agree  with  that  of  Bland 
(1965)  except  that  Bland  used  deformation  gradients  as  the  dependent 
variables^.  We  then  use  the  second  order  hyperelastic  materials  to  study 
the  geometry  of  simple  wave  curves.  There  are  four  material  constants 
but  the  simple  wave  curves  depend  only  on  one  non~dimensional  parameter 
k.  Depending  on  the  value  of  k,  there  are  four  different  simple  wave 
curves. 

In  Chapter  IV  we  study  the  shock  wave  solutions.  For  a  given  stress 
state  in  front  of  the  shock,  the  admissible -stress  state  behind  the 
shock  depends  on  the  shock  wave  speed  V.  As  V  varies,  the  stress  state 
behind  the  shock  traces  a  curve  in  stress  space  which  is  called  the 
shock  wave  curve.  There  are  three  possible  shock  wave  curves  associated 
with  the  three  shock  wave  speeds  and  V^.  Like  simple  wave 

curves,  shock  wave  curves  associated  with  and  are  plane 
polarized  and  the  one  associated  with  is  circularly  polarized.  The 
latter  is  identical  to  the  simple  wave  curve  associated  with  c^  since 
the  governing  differential  equations  are  linearly  degenerate  with 
respect  to  (Lax  1957) . 

A  shock  wave  solution  satisfying  the  Rankine-Hugoniot  jump  conditions 
is  not  necessarily  admissible  unless  it  also  satisfies  the  Lax  stability 
condition  (Lax  1957) 


Ci(B)  <  Vi(B,A)  <  Ci(A)  , 


(1.1) 


where  B  and  A  denote,  respectively,  the  stress  state  in  front  of  (or 
Before)  and  behind  (or  After)  the  shock  wave.  Without  Lax  stability 
condition  the  solution  may  not  be  unique.  A  simple  example  of  non-^ 
uniqueness  is  provided  by  Hopf’s  equation  (Hopf  1950,  Witham  1974).  For 
a  large  amplitude  shock,  Lax  stability  condition  is  necessary  but  not 
sufficient.  A  more  discriminating  condition  was  proposed  by  Liu  (1974). 
Let  P  be  any  point  between  A  and  B  on  the  shock  curve.  The  Liu 
admissibility  condition  reads 


Vi(B,P)  <  V.(B,A) 


(1.2a) 


Also,  if  we  define  the  "reversed”  shock  wave  curve  the  locus  of  the 
stress  state  B  in  front  of  the  shock  for  a  fixed  stress  state  A  behind 
the  shock,  and  if  Q  is  any  point  on  the  reversed  shock  wave  curve 
between  B  and  A,  then  the  Liu  admissibility  also  stipulates  that 


Vi(B,A)  S  VjCQ.A) 


(1.2b) 
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In  general  the  shock  wave  curve  and  the  reversed  shock  wave 
different.  In  any  case  we  have 


curve  are 


V^CB.B)  =  C£(B)  and  VjCa.A)  =  cjCa)  ,  (1.3) 

and  (1-1)  is  a  special  case  of  (l*2a,b)e  As  in  Chapter  III,  we  use  the 
second  order  hyperelastic  material  to  study  the  geometry  of  shock  wave 
curves*  Again,  the  shock  wave  curves  are  found  to  depend  only  on  one 
parameter  k. 

By  neglecting  the  entropy  in  the  second  order  materials  the  analysis 
is  thermodynamically  correct  because  the  effect  of  entropy  on  the  shock 
discontinuity  is  in  the  third  order  of  strain  (or  stress)  and,  with  the 
adiabatic  approximation,  the  entropy  is  a  constant  through  a  simple 
wave-  We  therefore  study  the  solutions  to  the  Riemann  problem  for 
second  order  hyperelastic  materials  in  Chapter  V,  This  chapter 
represents  the  main  work  of  this  paper.  We  consider  all  possible 
combinations  of  initial  value  and  boundary  value,  both  of  which  are 
constants*  In  view  of  the  fact  that  the  c^  simple  wave  (which  is  also 
the  Vj  shock  wave)  is  linearly  degenerate,  there  is  no  lose  of 
generality  in  ignoring  the  c^  simple  wave.  Therefore  all  we  have  to 
consider  are  simple  wave  curves  associated  with  c^  and  C3  and  shock 
wave  curves  associated  with  V,  and  V  .  These  curves  lie  on  the 
(<7,t)  plane.  Consequently,  the  problem  is  reduced  to  a  two^^wave  speed 
system  instead  of  a  three-wave  speed  system.  If  ^  C3,  the  system 
is  said  to  be  "totally  hyperbolic"  or  "strictly  hyperbolic"  (Courant  and 
Hilbert  1962,  Lax  1957).  In  our  case  th^  system  is  totally  hyperbolic 
everywhere  except  at  the  point  (o',r)  =  {(j  ,0)  at  which  c^  =  C3. 

This  point  is  called  the  umbilic  point  (Shearer  et  al,  1985). 

The  existence  of  the  umbilic  point  leads  to  the  following  interesting 
and  unexpected  results,  some  of  which  have  also  been  found  by  Schaefer 
and  Shearer  (1985)  in  the  problem  of  oil  recovery. 

i)  For  the  hyperbolic  materials,  the  simple  wave  curves  associated 
with  different  wave  speeds  are  orthogonal  to  each  other.  Hence  there 
are  two  wave  curves  through  each  point  on  the  (a,r)  plane.  At  the 
umbilic  point,  however,  the  simple  wave  curves  may  not  be  orthogonal  to 
each  other  (see  Figs -1-6).  Moreover,  there  may  be  infinitely  many  wave 
curves  passing  through  the  umbilic  point  (Fig. 2.  See  also  Ting  1973)). 

ii)  For  the  reduced  two-wave • speed  system,  there  are  in  general  two 
wave  fans  in  the  solution.  The  wave  fan  may  be  a  simple  wave,  a  shock 
wave  or  a  composite  wave  in  which  a  shock  wave  is  in  contact  with  a 
simple  wave  of  the  same  family  (Fig. 8).  However,  when  the  wave  curve 
passes  through  the  umbilic  point,  one  may  have  three  or  even  four  wave 
fans*  (See  wave  pattern  10  and  11  of  Fig. 8.) 

iii)  From  a  given  point  other  than  the  umbilic  point  on  the  (cr,T) 
plane,  one  can  draw  two  simple  wave  curves  and  two  shock  wave  curves. 
They  are  orthogonal  at  the  starting  point.  As  one  follows  one  of  the 
wave  curves,  the  curve  may  intersect  with  the  other  wave  curve  starting 
from  the  same  point  (Fig.9b)*  This  causes  the  solution  to  depend 
dis continuously  on  the  boundary  condition*  (See  also  Li  and  Ting  1983.) 

iv)  For  a  large  amplitude  shock,  we  have  an  example  in  which  the 
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shock  wave  speed  satisfies  Lax  stability  condition  (1,1)  for  both  i  =  1 
and  i  =  3  (see  (5.6c)).  Thus  the  shock  has  the  double  role  of  being  a 
shock  as  well  as  a  shock. 

v)  In  another  example  we  have  a  situation  in  which  Lax  stability 
condition  (1.1)  is  not  satisfied  for  either  i  =  1  or  i  =  3  (see  (5.9c)). 
We  have  therefore  no  solutions  which  satisfy  Lax  stability  condition* 
However,  for  the  example  concerned  the  shock  wave  curve  is  along  the 
cT^axis  and  hence  involves  only  one  stress  component.  Considering  the 
wave  motion  with  one  stress  component  one  would  obtain  a  one~wave  speed 
system.  Under  the  one^wave  speed  system  the  shock  concerned  satisfies 

the  Lax  stability  condition.  We  have  therefore  a  paradox  in  which  a 

shock  is  stable  under  the  one-wave  speed  system  but  unstable  under  a 
two-wave  speed  system. 

In  the  last  chapter  we  consider  the  effects  of  including  the  third 
order  terms  in  the  constitutive  equations.  As  pointed  out  by  Bland 
(1969),  we  can  no  longer  ignore  the  entropy  since  the  effect  of  entropy 
in  a  shock  wave  is  of  third  order  in  strain  and  hence  in  stress.  For  a 
special  third  order  hyperelastic  material,  we  show  that  the  entropy  jump 
across  the  shock  is  positive  for  the  solutions  obtained  in  Chapter  V*  A 
slightly  more  general  third  order  material  is  then  used  to  study  the 

effects  of  third  order  terms  on  the  geometry  of  shock  wave  curves. 

Finally,  we  consider  the  general  third  order  hyperelastic  materials.  We 
show  that  there  may  be  as  many  as  three  umbilic  points  on  the  (cr.r) 
plane,  one  of  which  may  be  an  umbilic  line.  This  is  interesting  since 
it  is  not  common  to  have  an  umbilic  line. 
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ABSTRACT.  Boundary  value  problems  involving  a  conformal  transformation  of 
simply  connected  regions  when  formulated  by  two  commonly  used  methods,  one 
using  a  vector  approach  and  the  other,  a  more  general  tensor  approach,  reveal 
an  apparent  disparity,  the  resolution  of  which  leads  to  a  condition  which  must 
be  satisfied  by  the  Jacobian. 

As  illustrations,  (1)  the  flow  of  a  viscous  incompressible  fluid  in  an 
eccentric  annulus  and,  (2)  the  torsion  of  a  hollow  shaft  with  a  similar  geo¬ 
metry,  are  considered  and  the  conditions  to  be  satisfied  by  the  Jacobia.is  are 
obtained. 

I.  INTRODUCTION.  Invariant  formulations  of  physical  laws  and  their 
mathematical  modeling  require  the  use  of  vector  and/or  tensor  approaches.  By 
a  vector  approach  is  meant,  in  the  present  context,  the  use  of  traditional 
vector  calculus  operations,  typical  under  a  Euclidean-space  setting,  without 
the  explicit  use  of  covariant  differentiation  and  the  Christoffel  symbols, 
which  characteristically  belong  to  the  domain  of  tensor  calculus.  By  a  tensor 
approach  is  meant  the  use  of  operations  of  covariant  and  intrinsic  differentia¬ 
tions  which  explicitly  involve  the  Christoffel  symbols  defined  in  a  Riemannian 
space.  These  Christoffel  symbols  are  geometry-dependent  quantities  which,  in 
general,  are  not  tensors  themselves  except  under  the  affine  group  of  transfor¬ 
mations.  Also,  these  symbols  identically  vanish  in  the  case  of  Cartesian 
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tensors  since  the  latter  are  always  defined  relative  to  a  Cartesian  frame  of 
reference.  As  a  consequence  of  the  use  of  the  Riemannian  space  in  the  tensor 
approach,  as  opposed  to  that  of  the  Euclidean  space,  the  tensor  derivatives 
which  occur  in  the  gradient,  divergence  and  curl  operations  are  involved  with 
a  more  elaborate  geometry  acquired  through  the  Christoffel  symbols  than  the 
geometry  involved  in  the  various  vector  operations.  It  is  the  purpose  here  to 
examine  more  closely  the  consequences  of  this  geometric  structure  of  the 
tensor  ap()roach,  which  produces  extra  terms.  However,  t!u,so  extra  terms  must 
vanish  since  the  vector  and  tensor  approaches  must  both  produce  the  same 
outcome.  It  is  shown  in  this  paper  that  when  the  above  techniques  are  applied 
to  a  physical  problem  involving  a  conformal  mapping  of  a  simply  connected 
region  occupied  by  a  material,  there  v.'ould  result  a  useful  geometric  condition 
on  the  Jacobian  of  the  conformal  transformation.  In  physical  problems 
involving  materials  with  their  complicated  geometry  requiring  the  use  of 
curvilinear  coordinate  systems,  the  tensor  approach  results  in  numerous  terms 
whose  occurrence  poses  considerable  complexity.  The  above-mentioned 
constraining  condition  on  the  Jacobian  helps  readily  identify  those  terms 
which  must  vanish,  thus  resulting  in  a  substantial  reduction  in  the  complexity 
of  the  problem. 

We  present  some  applications  of  the  above  concept  to  both  fluid  and  solid 
mechanics. 
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II.  miUlULATlOd 


Consider  a  one-to-ono  transfoniiation  defined  by 

z  -  i’(c), 

z^  “  +  Ix^,  C  "  +  iX^,  i  “  /“i  t  ("-1^ 

wl'ero 

X  ,  X*"  e  IR  ,  k  -  1,2,3, 

are  triplets  of  real  numbers  represent! rlij  two  roctunyular  Cartesian  systems. 

L(?t  the  first  of  tlie  evjuations  in  (2,1)  represent  a  conformal  transformation 
from  the  plane  to  tlie  plane  throu'jh  the  analytic 

function  F  defined  over  a  simply  connected  open  set  D  such  that  its  derivative 
F'(c).  ^  0  in  D,  The  Jacobian  of  Lite  above  conformal  transformation  aiul  . 

its  inverse  arc,  rcspective-ly , 

J  "  9(X^,X^)/0(x^,x^) 

-  (DXVOx^XDX^/Ox^)  -  (0XVax2)(DX2/0x^)  >  0  ,  (2.2 

-  D(x^x2)/a(Xl,X2)  «  (DxVdx1)0>:2/DX2)  -  (OxVoX^)  (Ox^/DXl ) 

•*  (Dx^/DX^)^  +  (Dx^/DX^)^  >  0  ,  (2,2 
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where  the  last  step  in  (2.3)  follows  from  the  previous  one  by  the  use  of  the 
Cauchy-Riemann  equations 


3xV3X^  =  3x^/3X^  ,  3x^/3X2  = .  (2.4) 

The  lino  element  ds  in  the  x^x^  space  is  related  to  that  in  the 

x^a2  space  by  tiie  fol  lowing. 

ds^  =  (dx^)^  +  (dx^)^ 

=  j"’t(dXi)^  +  (dx2)2  =  pj^^dx'^dx^  ,  (2.5) 


where  in  the  last  step  in  (2.5),  summation  over  repeated  indices  is  implied 

and  we  shall  use  this  summation  convention  in  the  sequel.  Thus  from  (2.5),  w 

kl 

find  that  the  metric  tensor  and  its  reciprocal  g  are  given  by 

9n  -  -rr  ■  -  »22  "  ^2  '  5'''  '  Skt  ■  "■  ■ 

g  9  (2.6) 

Next  introduce  over  the  region  D,  a  continuously  differentiable,  symme¬ 
tric,  second  rank,  linear  tensor-valued  function  t  of  another  tensor-valued 
function  d  with  similar  properties,  which  in  turn  is  a  linear  function  of  the 
gradient  of  a  continuously  differentiable  vector  field  v  as  follows. 
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t(Xl,x2) 


(2.7) 


=  (a^trd)l  +  2a^d 
d(xl,x2)  =  (l/2)[(Vv)  +  (Vv)"^]  (2.8) 

where  i  aidcntity  tensor,  and  ,  arc  scalars  independent  of 

the  space  variables  x^  and  x^  ,  and  the  superscript  T  over  a 
tensor  denotes  its  transpose. 

Eqs.  (2,7)  and  (2.0)  con  alternately  be  expressed  in  the  component  nota¬ 
tion  for  tlic  yiven  orthoyonal  coordinate  system  as 


.kt 

d  ^ 


where  the  semi-colon  followed  by  an  index  represents  the  covariant  differen¬ 
tiation  with  respect  to  space  variables  and  an  underscore  below  an  index 
indicates  the  suspension  of  summation  over  that  index.  For  example,  in  the 
xU^  space. 


(a^trd)g‘'*'  +  2a^d'"*' 


m  kZ  , 

a  V  g  +  Za^d  p 
o  ;  m'" 


1 


,  mk  ni 

cl  g  g 

mn 


(1/2)  (v'^;j^g--  +  v*’.,^g--) 


(2.9) 


(2.10) 
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1,  2, 


•  •  • 


(2.11) 


k 

V 


+ 


irifk  , 

V  (  0^ 

mil 


d 


12 


(l/2)(v’.2g^^ 


)  . 


(2.12) 


The  expression  in  (2.11)  denotes  the  Christoffel  symbol  of  the 
second  kind  [SYNGC  and  SCIIILD  (1949)]  which  is  related  to  that  of  the  first 
kind,  [lm,n],  and  is  defined  below, 

=  (l/2)g'^"(^i?il 
ax'” 


9q 


■  mn 


9g 


ax 


9A' 


(2.13) 


The  last  step  on  the  right-hand  side  of  (2.9))  follows  immediately  by  taking 
the  trace  of  (2.8)  and  noting  that 


trd  =  V-v  =  v"^_.  (2.14) 

One  of  the  principal  objectives  of  the  present  Investigation  is  to  compute 
the  divergence  of  t  ,  by  two  independent  methods,  vectorial  and  tensorial, 
in  order  to  derive  the  condition  to  be  satisfied  by  the  Jacobian  of  the  confor¬ 
mal  mapping  (2.1). 
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Vim:  I  or  ApprOiu:li 


Tlio  (JivcryonCG  of  (2.7)  is  ijiven  by 


V*t  “  Q  V(y*^)l  +  2ajV’d 


(2.15) 


where  is  obtained  from  (2,0): 


v»d 


(l/2)[V(V*v)  +  V^v] 


(1/2) [2V(V*v)  -  7xVxv]  , 


so  that  (2.15)  becomes 


V.t  B  a^V(V-v)I  +  a^[2V(Y-v)  -  Vx^xv] 


(2.16) 


(2.17) 


Ihe  pliysical  components  of  V't  in  the  yivcn  orthogonal  coordinate  system 
of  the-  conformal  mapping  in  (2,1)  can  bo  computed  from  those  of 
V(v-y)  and  vxVxy  by  the  usc  of  formulas  given  by  UllITllAM  (1903).  Those 
computations  lead  to  tlio  following  physical  components,  denoted  by  enclosing 
the  index  in  parentheses. 


df, 

[v(v*v)](i)  «  vf— j-  , 

DX' 

3f  5  d? 

[7xVxv](i)  s  /J  [<5.( — j)  -  6.( — , )]  , 

'  sx^  ‘  ax' 


(2.18) 


f,  °  J[^-(v(l)//T}  +  ^v(2)/yJ}]  , 

'  OX  ax 


r,  =  j[^{v(2)//j}  -  ^-ivd)//!]]  , 

ax'  ax 


(2.19) 
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wlipre  <s!  is  the  Kroncckcr  delta  and  i=l,2  .  The  quantities  y(0  and  v(2) 
are  titc  physical  components  of  v  correspondiny  to  X  and  X.'  .  Thus,  we 
obtain 


3f, 


[V*t],,(i)  =  a  /r( — j-)  +  a,v^[2( — .) 


V 


3X 


ax 


1  ^*^2 

6  .  (-4) 

ax 


I 


+ 


af 


ax 


2 

r 


-)] 


(2.20) 


where  the  suffix  v  on  the  left-hand  side  denotes  the  value  of  the  physical 
component  ofV»t -obtained  by  using  the  vector  approach  andi=l,2  . 


Tensor  A|)p roach 


Tfie  divergence  of  t  in  the  tensor  ap[)roach  is  obtained  by  using  Lfie 
following  well  known  expression,  see  SYNGE  and  SCHILD  (1949) 


[V-t]' 


k«, 


at 


kJ!, 


;k 


ax 


(2.21) 


P 

where  (V*t]  denotes  the  1  th  contravariant  component  of  7»t  .  Substituting 
(2.10)  into  (2,9)  and  the  resulting  expression  into  (2.21),  and  tlien  applying 
the  well  known  Ricci's  theorem  of  tensor  analysis  [SYNGE  and  SCllILb  (1949)], 


for  all  k,  and  m,  one  obtains 


(2.22) 


==  a  v'(r 


a:i: 


af]  ....kt 

■  r  9““  +  Of]  [ 

ax^  ax 


’I 

J-  Ik/ 
nik 


(2.23) 
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where  and  i'"^  are  all  obtainable  from  the  general  expression 


I 


i  Jj 

V. .gii 

9  J 


+ 


i2.2k) 


It  may  readily  be  identified  that  the  bracketed  expression  in  the  coefficient 
of  in  (2.23)  is  the  covariant  derivative  and  divergence  of  the  tensor  with 
respect  to  ,  that  is. 


I 


k£ 


;k 


+ 


_m£rk 

^mk 


> 


(2.25) 


For  practical  applications,  one  needs  to  obtain  the  physical  components 
from  the  vector  and  tensor  components.  The  latter  are  related  to  the  former 
by  [see,  TRUESOELL  (1953,  1954)  and  ERICKSEN  (I960)] 


v(k)/.^ 


(2.26) 


=  t 


(k)(£)'’'«kkyu 


kJl 


=  t 


(k)C£)  ^kk^tt 


(k)(£) 


where  the  indices  enclosed  in  parentheses,  as  before,  represent  physical 
components.  Thus,  in  tensor  approach,  the  physical  components  of  v*t  are 
obtained  from  (2.23)  leading  to 


[v*t]^(i)  = 


9f. 


9X 


1  ;k'^  M 


i.k  =  1 ,2 


(2.27) 
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where  the  suffix  T  in  the  left-hand  side  of  (2.27)  denotes  the  value  of  the 
physical  component  of  V*t  obtained  by  using  the  tensor  approach. 


Since  the  values  of  the  physical  components  of  V*t  obtained  by  using  the 
vector  and  tensor  approaches  are  required  to  be  identical,  we  must  have  from 
(2.20)  and  (2.27) 


[^•tj^(i) 


a  ^  -  /r(2^ 

ax' 


19^2  2^'^? 


(2.28) 


“  0  , 


for  the  coefficients  of  a  in  the  two  terms  on  the  left-hand  side  of  (2.23) 

0  . 

I'eing  identical  automatically  cancel  out.  The  coefficient  of  in  (2.28) 
can  be  evaluated  by  using  (2.19)  and  (2.24)-(2.26) .  After  somewhat  lengthy 
but  straightforward  algebra,  one  readily  finds  that  (2.28)  finally  reduces  to 


where 


v2j  -  (l/j)[(3j/3Xl)^  +  (3J/3X2)^]  =  0 


V2 


(2.29) 


(2.30) 


is  the  familiar  Laplacian  operator  of  the  plane  rectangular  Cartesian  system 

(x\x^). 

It  can  now  be  readily  verified  that  the  Jacobian  of  a  transformation  which 
is  given  to  be  conformal  readily  satisfies  the  condition  (2.29).  Conversely, 
by  retracing  the  steps,  one  readily  finds  that  if  the  Jacobian  of  an  arbitrary 
plane  coordinate  transformation  satisfies  (2.29)  then  the  latter  must  bo  a 
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conformal  one.  Thus  eq,  (2.29)  is  an  invariant  geometric  property  of  the 
Jacobian  since  it  is  independent  of  the  choice  of  the  conformal  map  F  in  (2.1). 

III.  APPLICATIONS 

In  this  section,  two  physical  examples  are  presented,  one  from  fluid  mecha¬ 
nics  and  another  from  solid  mechanics,  in  order  to  illustrate  the  above  pro¬ 
perty  of  the  Jacobian  of  the  conformal  transformations  involved. 

Ilia.  Conformal  mapping  of  the  flow  between  eccentric  rotating  cylinders 

The  flow  between  eccentric  rotating  cylinders  is  most  conveniently  studied 
by  introducing  a  modified  bipolar  coordinate  system,  see  WANNICR  (1950),  WOOD 
(1957),  and  DIPRIMA  and  STUART  (1972).  Here  the  main  interest  consists  only 
in  the  conformal  mapping  in  terms  of  the  modified  bipolar  coordinate  system 
and  in  the  derivation  of  the  condition  which  must  be  satisfied  by  the  Jacobian 
of  the  transformation.  The  momentum  equations  for  a  laminar  flow  in  terms  of 
the  stress  field  are  also  formulated. 

Consider  the  flow  between  two  infinitely  long  circular  cylinders  of  radii 
a  and  b  (>a)  with  centers  set  a  distance  ae  apart.  In  order  to  insure  that 
the  cylinders  do  not  touch  we  require  ae<b-a,  which  can  be  written 

0  »  e  <  1  ,  (3.1) 

where 

c  =  e/i  ,  6  *=  (b  -  a)/a  .  (3.2) 


679 


The  parameter  e  is  the  eccentricity  and  the  parameter  6  ,  called  the 

clearance  ratio*  is  a  measure  of  the  ratio  of  the  moan  clearance  between  the 
cylinders  to  the  radius  of  the  inner  cylinder.  Lot  the  polar  coordinate 
system  have  its  oriyin  at  the  axis  of  the  inner  cylinder  with  the  ray  e  =  o 
through  the  axis  of  the  outer  cylinder. 


Following  Wood  in  further  detail  we  introduce  modified  bipolar  coordinates 
by  means  of  the  conformal  transformation 

z  “  a(c  +  y)/(1  +  Yt)  I  z  "  X  +  iy  =  re'®  »  C  "  C  +  'n  =  pe'*^  ,  (3.3) 

where  y  is  a  real  constant  given  by 

Y  -  {J/2e){-[2  +  6(1  -  e)]  +  [{2  +  6(1  -  e)}^  -  (3.^») 


The  coordinate  curves  p  =  constant  are  circles;  in  particular  the  inner  and 
outer  cylinders  are  given  by  p  =  i  and  p  =  g  ,  respectively,  where 


]  +  6  +  e6  ~  y 
1  -  (1  +  6)y  -  eY6 


(3.5) 


An  advantage  of  tlie  modified  bipolar  coordinate  system  as  compared  to  the 
usual  bipolar  coordinate  system  is  that  in  the  limit  e  o  the  P,f 
coordinate  system  reduces  to  the  r,e  coordinate  system  except  for  a  scale 
factor  a. 


The  Jacobian  J*  of  the  transformation  (3.3)  is  given  by 

J  “  (1  +  2ypcos<|>  +  Y^P^)V(1  -  Y^)^  ,  !yI  1  •  (3.(*) 
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The  eleiiiont  of  arc  length  in  two  dimensions  is 


ds 


2  2  2 
dr  +  c  dO 


(3.7) 


The  stress  components  for  an  incompressible  viscous  fluid  in  terms  of  the 
modified  bipolar  coordinate  system  p,(|,  are  obtained  from  the  linear  con¬ 
stitutive  relation  having  the  form  (2.7)  and  (2.8)  wittj  a  trd  =  -h  and 

1  O  "" 

Oj  “  u,.  wliere  t  •»  stress  tensor,  d  =  deformation  rate-  tensor,  y  =  velocity 
vector,  p  =  fluid  pressure  and  u  =  viscosity  coefficient  (assumed  to  be  inde¬ 
pendent  of  spatial  coordinates). 


In  the  vector  approach,  Y’t  takes  the  form  (2.17).  (low  let  v  and  v, 

p  <1) 

be  the  velocity  components  in  the  p  and  (|)  directions,  respectively. 

Then  tlie  component  forms  ofV*t  become,  in  the  p  and  ((i  directions, 

*1# 


(v-t],,(p)  ^  ’ 


(3.8) 


/r 


where 
9 


D  ^  P  4-  -1-r  _  ( _ ^)  1 

Ir -iiTl-cr— ) J  +  7T  a,,.  LJ..  y  » 


1  aF  'p  H  '  /j' 


2j!^  4.  • 

’■J  _  .  f\|  — ^ 


p  3p  3(|)  p  3p 


(3-9) 


g  .  [J-  +  |.  . 

^3  9p  ^ap  8p  9<l>  /j 


and 


lV-t]^(l-) 


p/r 


ap  3(|) 


(3. 1C 
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wliere 


M  “  3'p 


aij. 


TV  9v  1  TT 

2J _ P_  J_  D J  p  2_  0^ _ ^ 

p2*  jjj,  *  p  Op  D<|i  ^  p  5i|i  Op 


(3.11) 


h  .  11- [J- (.?..(_!!ii)  +  i-(-iin 

3  P  3*  ‘ap  '3p'  *  3+'  • 

In  the  tensor  approach,  the  stress  constitutive  relation  takes  the  form 
(2,9)  and  (2,10)  with  ^  =  y  ,  The  expressions 


for  [v-t]^(p) 


and  [v*t]^(0)  are  obtained  from  (2.27).  Thus, 


[V*t]^(p)  ® 


.  ^  ^  ^  (T  ^  ^  ) 

a  Op  1  2 


(3^12) 


where 


3v 

2~  (/r^ ) 

Op  op 

P,  Op 

I  T  T  3v 

_  j _ ^ _ p_ 

/j  Op  Op 

-  ""p  /W, 

2J^ 

-  2  OJ  ^''(|) 

+  %  , 

p/T  3(()  9p 

2p2/r  0^ 

- 

2pvG’  0p04( 

H 

P  3F“  * 

L  i-(/j  ^i) 

p  0(j)^  Op  ' 

2v  2v 

-4/r  +  — 

^  p/j 


Apj/r  Op 


OJ 


2p/r  0(1)  op 

/.p2  j/j  ^^ 


(3.13) 


(3.1A) 


%  O^J 
2p^/j  0-1(2 
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anti  a  corrospontlituj  expression  for  [v*t]^(<|))  which  is  obtained  in  like  manner. 
Now,  since  the  vector  and  tensor  approaches  must  produce  identical  results  for 
v*L  ,  the  difference  between  (3.8)  and  (3,12)  must  vanish  as  also  the  differ¬ 
ence  between  (3.10)  and  the  corresponding  expression  for [V't]^(ij))  must  vanish 
for  all  possible  arbitrary  velocity  fields.  Tlius 


[v-t]^(p)  - 

yields  the  condition 

a^j  ^  1  .  +  I 

p  3p  ^ 


tv-t]v(p)  “  0 


1  f 

J  ^'•3p^ 


P 


W.2 

3(J)^ 


0 


(3.15) 


(3.16) 


Similarly,  it  can  be  readily  verified  that 

-  [v-t]y(<|.)  =  0  ,  (3.17) 

yields  the  same  identical  condition  (3.16),  This  establishes  the  constraint 
on  the  Jacobian  of  the  conformal  mapping  given  in  (3.3). 


For  purposes  of  future  applications,  it  is  useful  to  formulate  here  the 
momentum  equations  in  terms  of  the  stress  divergence  components  employing  the 
modified  bipolar  coordinates  p,<|)  : 


3v 


P 


3t 


+ 


a  p3p 


ap  tdtp 


1  3  //j\ 

—  — ^  ( — )  V  V 

a  3(j)  'p  '|)  p 


S.  i-(jSv2 

a  c)p  p  <i> 


(3.18) 


aa 


8t 


t  ^ 


PP 


Bp 


^~)(t 
2J  3p  PP 


I  A 

If  Pi 

p 


jL'Il  id'!  1 

J  Scl)'-*  ’ 
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r.iirf. k:o«;  hoilvj  lonj  cyl  indcTS  which  are  not  concentric.  Suppoco  that 


e  nz 


material  of  the  cylinders  is  made  up  of  a  linear  homogeneous  isotropic  elastic 
solid  characterized  by  Hooke's  law  with  constant  lame  coefficients.  Tlie  [Drisin 
is  twisted  by  couples  applied  at  the  ends  of  the  prism.  It  is  well  knov;n,  see 
TIMOSIICNKO  and  GOODIEU  (197U),  that  the  dofynnation  of  the  twisted  shaft 
consists  (1)  of  rotations  of  cross  sections  of  the  shaft  and  (2)  of  warping  of 
the  cross  sections  assumed  to  be  the  same  for  all  cross  sections.  In  order  to 
delineate  the  warping  of  the  cross  sections,  a  function  known  as  the  torsion 
function  is  introduced.  The  determination  of  this  function  is  an  integral 
part  of  the  problem.  The  solution  of  this  problem  is  most  conveniently 
effected  by  introducing  a  conformal  transformation  in  the  plane  of  a  cross 
section  of  the  prism 


z  =>  c'tan^C  F  z  =  X  +iy  ,  C  °  E  +  in  ,  (3-22) 

whore  2c  is  the  distance  betv/een  the  poles  of  the  bipolar  coordinate  system 
(;  ,  n  induced  by  (3.22).  In  the  present  study,  we  are  only  interested  in 

tlio  pruporty  of  the  Oacobian  of  the  above  conformal  transformation. 

The  Jacobian  of  the  transformation  is  readily  found  using  (2.2). 

j  =  ^(cosC  +  coshn)^  >  0  .  (3-23) 

0 

The  curves  n  =  constant  are  found  to  be  coaxial  circles  with  equations 

2  2  2  2 
X  +  (y  -  c'cothn)  “  c  cosech  n  ,  (3.2^0 

wliile  the  curves  K  “  constant  are  circles  given  by 

(x  +  c-cotO^  +  =  c^cosec^C  F  (3-25) 
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and  the  cross  section  of  the  prism  is  bounded,  say,  by  the  eccentric  circles 
n  =  and  n  =  . 

The  stress-strain  relations  and  other  pertinent  formulas  of  linear  elasti¬ 
city  are  all  analogous  to  the  formulas  (2.15)-(2.17)  where  t  is  the 
stress  tensor,  d  is  replaced  by  the  strain  tensor  e  »  the  vector y  is  replaced 
by  tlje  displacement  vector  u  and  and  are  respectively,  replaced  by  the 

Lame  coefficients  X  and  y  .  - 

e  e 

Following  the  general  method  giver,  in  section  2  one  obtains  the  expres¬ 
sions  for  stress  divergence  using  the  vector  and  tensor  approaches.  Thus 


e3n  ^  /r 


[v-t]y(n)  = 


+  » 

e95  95  /j  9n  /j 


(3.26) 


(3.27) 


where  and  are  the  components  of  the  displacement  vector  relative  to  the 
bipolar  coordinate  system  C  and  n  . 
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In  the  tensor  approach 


[v-tljCO 


'gn  1  M  f  1 

1  J  3n  Sn;'’  ’ 


(3.28) 


[v-t]^(n) 


dt 


+  - 1  ) 
2J  9n^^gg  nn^ 


+  _  1 M  .  1 

9C  J  95  Cn-^ 


(3.29) 


3u  , 

t  =  /rrx  ( _ L  +  _ n  -  i_ ..  1 

^55  95  9n  2J  ^59g'  2J  “naTT^ 


+  2y  (  — &•  -  —  u  — )  ] 

e'  95  2J  nan'^^  ’ 


(3-30) 


/?„  (_!i  ^  Lii ,  +  i_u  H.) 

9n  2J  95  n  95  2J  5  an^  ’ 


(3.31) 


_  au_  9u 

e  95  an 


2J  ^^595 


J_  ^  V 

2J  ^n  an 


9u 


1  gjv, 

2J  ^5  95^^ 


(3.32) 


Substituting  (3.30)  -  (3.32)  into  (3.28),  one  obtains 


[V*t]_(5)  =  A^E,  +  y  E, 

I  el  e  ^ 


(3.33) 
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where 


F,  =  J 


+  J 


3^u 
_ n 

aean 


2  e  ac^ 


1 

2  ^nscan 


1  ClT 

I  3J  ri 


1  3J.  3J 


2  3n  3?  5j  35  3ti  ^ti 


+  lH!!a 

2  35  3n 


(3.34) 


F.  =  2J 


3^ 


1  ii.  n 

2  3ri  35 


1  a^J  1  8J  9J 

2  353n  35  3n  “n 


-  TT  U 


3  ST 

.  3,  ^ _ n 

2  35  an 


1  /ajA2 
2J  ^535 


3 


3Jx2 


^  /C»J  \ 

57  ""5^ 


+  j 


+  J 


acan 


n  .  1 

+  —  u 


32j 


2  “5  an^ 


(3.35) 


Subtracting  (3.26)  from  (3.33),  one  obtains 


[v*t]^(5)  -  [v-t]y(5)  =  0  , 


(3.36) 


which  yields  for  all  possible  arbritary  choices  of  u 


(3.17) 


In  .like  manner,  the  difference -expression, 

[v*t].|.(Ti)  -  tv*t]y(n)  =  0  ,  (3.38) 

also  leads  to  the  same  identical  condition  as  in  (3.37)  on  the  Jacobian. 
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IV.  DISCUSSION 


From  the  foregoing  analysis,  the  following  observations  and  conclusions 
can  be  effected  regarding  the  properties  of  the  Jacobian  of  the  conformal 
transformations. 

(1)  The  condition  satisfied  by  the  Jacobian  given  by  (2.29)  is  entirely 
independent  of  the  mapping  function  of  the  conformal  transformation  and  hence 
is  invariant  with  respect  to  arbitrary  choices  of  conformal  maps. 

(2)  Also,  the  condition  on  J  is  independent  of  all  physical  fields  and 
parameters  within  the  class  of  continuously  differentiable,  symmetric  tensor¬ 
valued  functions. 

Hence  the  constraint  on  the  Jacobian  is  an  invariant  geometric  condition 
which  arises  purely  as  a  geometric  property  of  the  conformal  transformation 
and  its  Jacobian. 

(3)  Only  the  coefficient  ofa  in  (2.28)  vanishes  identically  without 

o 

constraining  J  in  any  manner,  but  the  coefficient  ofa^  involves  the  condition 
on  J.  The  phenomenon  can  be  accounted  for  by  observing  that  the  coefficient 

ofa  involves  the  first  invariant,  trd  ,  and  owing  to  this  invariance,  there 

0 

occurs  no  change  in  the  geometry  of  the  region.  On  the  other  hand,  the  co¬ 
efficient  ofoi^  is  indeed  associated  with  a  geometry  change  since  it  leads  to  a 
constraint  on  the  Jacobian  of  the  transformation  causing  such  a  change.  If 
for  example,  v  is  the  velocity  vector  of  a  fluid  particle  in  a  given  region,  of 

flow,  the  coefficient  v(v»v)  of  a  does  not  involve  any  distortional 

-  0 
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change  since  it  represents  a  dilatation  rate.  On  the  other  hand,  the  coeffi¬ 
cient  of  does  represent  a  change  of  shape  which  means  a  change  in 

geometry  implying  a  constraining  condition  on  the  Jacobian  of  the  transfor¬ 
mation  causing  the  change  of  shape. 

Thus,  the  vector  and  tensor  approaches  yield  identical  results  for  the 
divergence  of  a  second  rank  tensor  t  of  the  type  described  earlier  if  and  only 
if  the  invariant  geometric  condition  (2.29)  on  the  Jacobian  of  the  conformal 
mappiiig  in  (2,1)  is  satisfied. 
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DUE  TO  FLUID  LOAD 

EDWARD  W.  ROSS,  JR. 
Mathematician 

US  Army  Natick  Research  &  Development  Center 
Natick,  Massachusetts  01760-5017 


Abstract 

This  paper  analyzes  the  behavior  of  a  tarpaulin,  suspended  at  its  ends  when  a 
puddle  of  water  accumulates  on  it  during  a  rainstorm.  The  tarpaulin  is  taken  as 
a  wide,  thin  sheet  of  linearly  elastic  material,  without  bending  strength, 
initially  horizontal  and  unstressed.  It  undergoes  large,  plane  deformation, 
caused  by  the  weight  of  the  puddle,  and  is  idealized  as  a  string.  The  solution 
is  found  in  closed  form  except  for  one  boundary  condition  that  has  to  be 
satisfied  by  trial  and  error  and  involving  a  numerical  integration.  The  deformed, 
sheet  has  the  shape  of  a  sine  function  beneath  the  load.  Asymptotic  formulas  are 
derived  in  the  small  deformation  limit,  but  most  of  these  are  sufficiently 
accurate  to  be  useful  for  practical  ranges  of  the  parameters  when  the  deforma¬ 
tions  are  large. 

1.  Introduction 

Tents,  tarpaulins  and  other  forms  of  thin,  flexible  sheets  are  much  used  by 
the  military  for  the  shelter  of  troops  and  equipment.  The  theoretical  behavior 
of  such  structures  under  conditions  commonly  met  in  the  field,  such  as  rain,  snow 
and  wind,  has  not  been  studied  extensively.  The  present  paper  is  an  investiga¬ 
tion  of  one  such  situation,  namely,  when  rain  fails  on  a  wide  flat,  horizontal 
sheet,  fixed  at  the  edges.  Under  these  conditions  the  sheet  sags  and  water 
accumulates  in  a  puddle  near  the  center.  We  analyze  a  simple  version  of  this 
process,  i.e.  one-dimensional  continuum  (a  string)  under  a  liquid,  gravity  load 
that  varies  along  the  string  in  a  manner  consistent  with  the  condition  that  the 
liquid  surface  be  horizontal.  The  deformations  are  not  required  to  be  small,  but 
a  linearly  elastic  constitutive  relation  is  assumed.  Bending  and  dynamic  effects 
are  omitted. 

The  behavior  of  a  flexible  string  has  been  analyzed  by  many  writers,  going 
back  to  the  times  of  Bernoulli  and  Love.  Various  aspects  of  the  problem  have 
been  studied  recently  by  Antman  [1]  and  Pugsley  [2]  among  others.  Much  effort 
has  gone  into  the  development  of  finite-element  programs  for  the  numerical  solu¬ 
tion  of  nonlinear  string  problems,  see,  for  example.  Fried  [3],  Huddleston  [4] 
and  Peyrot  and  Goulois  [5].  The  author  is  aware  of  only  one  paper  dealing  with  a 
problem  similar  to  the  present  one,  Malcolm  and  Glockner  [6],  and  that  is  re¬ 
stricted  to  small  deformation. 
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2.  Formulation 


A  wide  sheet  is  idealized  as  a  one-dimensional  continuum,  a  string,  of  length 
2L,  initially  straight  and  horizontal,  with  its  left  end  at  x  =  y  =  0  and  mid¬ 
point  at  X  =  L,  y  =  0,  see  Figure  1.  The  sheet  is  deformed  symmetrically  about 
the  mid-point  by  a  puddle  of  liquid  with  a  horizontal  surface  and  variable  depth. 
Variables  X,  Y  are  the  deformed  coordinates  of  the  point  on  the  string  that  is 
initially  at  (x,  y). 

The  kinematics  of  the  deformed  string  are  specified  in  terms  of  e  ,  where 
tan  0  =  dY/dX  =  slope  of  string  (1) 

If  dS  is  the  deformed  length  of  a  small  string  element  whose  initial  length  was 
dx,  then 

dS/dx  =  1  +  ^  =ndX/dx)2  +  (dY/dx)^]^^^  (2) 

where  e  is  the  strain.  Also 

dX/dx  =  (1  +  e  )  COS0  ,  dY/dx  =  (1  +  e  )  sine  (3) 

We  do  not  assume  that  Ie|  «  l. 

The  constitutive  law  for  linear  elasticity  is  used, 

T  =  tension  in  string  =  T^^e  (4) 

although  it  is  conceivable  that  some  progress  could  be  made  with  a  more  general 
elastic  law. 


Equilibrium  of  a  string  element  in  the  X  and  Y  directions  implies 


dT/dx  =  -F  (dS/dx)  sine 
y 

T  de/dx  =  -Fy(dS/dx)  cose 


(5) 


where  Fy  dS  is  the  vertical  upward  force  of  the  liquid  on  the  deformed  string- 


element  of  length  dS.  We  assume  that  the  water  surface  is  a  distance  below  the 
initial  position  of  the  string  (see  Figure  2),  and  the  wet  portion  of  the  string 
has  initial  coordinates 

AL  »  X  =  L. 

If  u  is  the  density  of  the  liquid,  and  U  is  the  unit  step  function,  we  find 
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with  the  aid  of  (3) 

F  dS/dx  =  v(Y  +  6  )  (dX/dx)  U  (x  -  AL) 

=  v(Y  +  A  )  (1  +  e  '  coseu  (x  -  AL) 

The  system  of  equations  is  then 

dT/dx  =  -  jj  (Y  +  A  )  (1  +  e  )  sin  6  cos  e  U  (x  -  AL) 

Tde  /dx  =  -  (Y  +  A  )  (1  +  e  )  cos^e  u  (x  -  AL) 
dY/dx  =  (1  +e  )  sine 
dX/dx  =  (1  +  e)  cose 

These  have  to  be  solved  with  the  boundary  conditions 
X  =  Y=  0atx  =  0 
9=0,  X  =  Latx  =  L 
Notice  that 

Y  =  -  A  at  X  =  AL 
because  of  the  definitions. 

The  equations  may  be  put  in  dimensionless  form  by  means  of  the 
transformations 

Z  «  x/L,  5  =»X/L,  n  =  Y/L,  5  =A/l,  e  =  T/X^ 


P  “  L(p/T  ) 


1/2 


after  which  they  may  be  written 

de  /dz  =-p2  (  n  +  5)  (1  +  e-  )  sine  cose  U(z  -  A) 

de  /dz  =  -p^e"^  (  r)+  6  )  (1  +  e  )  co^e  U(z  -  A) 

dn  /dz  =  (1  +  e  )  sine 

dC  /dz  =  (1  +  e  )  cose 

with  boundary  conditions 

C  =n=0atz  =  0 
0=0,  c  =  1  at  z  =  1 


(6) 


(7) 


(8) 


(9) 


(10) 

(11) 

(12) 

(13) 

(14) 

(15) 
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and  the  intermediate  condition 

n  =  -6  at  z  =  A  (16) 

The  forcing  function  in  this  problem  is  the  total  load  of  liquid,  W,  in 
the  region  0=  zSl,  which  is  given  by 

W  =  -T  sinel^.Q 
or,  in  non-dimensional  terms 

W/T  =  \  =  -c(O)sin0(O)  (17) 

0 

3.  Solution 

An  al most-complete  solution  to  the  problem  in  closed  form  is  obtainable  by 
elementary  means.  The  solution  in  the  dry  (i.e.  unwetted)  region  0^z  =  A  can 
be  wri  tten^  by^  i  nspecti  on ,  and  the  solution  in  Aaz  =  l  which  joins  smoothly  with 
that  in  0  =  z  =  A  can  be  found  as  a  function  of  e  (which  is  monotone  in  A  =  z  =  l) 
by  dividing  pairs  of  the  equations. 

In  the  dry  region  the  solution  is  easily  seen  to  be 

e  =  e(0)  -  , 

5  =  (l+e^)zcose^  , 


0=0  =  0(0) 

0 

n  =  (T+c  )zsin0 

0  0 


(18) 


Then  at  z  =  A  we  have 


C  =  .  e  .  .  5  =  (l+£„)Acose^ 

n  =  -6  =  (1+e  )Asin0 

0  0 


(19) 


The  solution  for  z=A  is  found  as  follows, 
we  get 


-1 

e 


de/d0  =  tan0 


Dividing  Equation  (10)  by  (11), 


which  has  the  solution 


e  =  E  cose  sece 
0  0 


(20) 
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This  shows  that  e  =  e  ,  as  expected.  Next,  if  we  divide  (11)  by  (12)  we  see 
that  ° 

edo/dn  =  -p^(M+6)ctnOcos0 

using  (20),  this  can  be  rewritten  as 

.I 

sec^etane  +  Q  p2(ri+6)dn/d0  =  0 
and  has  the  solution 

n  =  -6  -  Q^/^p‘^H(e,0^)  (21) 

where 


Q  =  e  COSO  (22) 

^  0  o 

H  =  -  tane  (1  -  tan^e/tan^e  S  0  (23) 

0  0 

If  (13)  is  divided  by  (11),  the  resulting  equation  can  be  written 

de/de  =  -Q^^^p'^H'^sec^e 

which  has  the  solution 

1 17  -1 

C  =  A(Q  +  cos0^)  +  q'"^p  i|;(0,0^)  (24) 

ii)(0,0  )  =  tt/Z  -  sin"^ (tane/tan0  )  (25) 

0  0 

At  this  point  we  have  obtained  solutions  for  e,n  and?  as  functions  of  e 

which  satisfy  the  junction  conditions  at  z  =  A.  It  still  remains  to  find  the 

relation  between  e  and  z.  To  do  this,  we  must  solve  (11),  which  leads  to 


I/O 

~  k  +  Q  '^p  (Q  +  cost)  r,H(t)]  sec^tdt 


Integrating  by  parts  and  using  ,p(d  ,0  )  =  0  >  we  obtain 

0  0 


z  =  A  +  q''^^p''[(Q  +  cos9)''i|i(9,9  )  -  (Q  +  cost)’^*(t,9^)sintdtJ 

t-o  (27) 

0 

The  integrals  in  Equations  (26)  and  (27)  cannot  be  evaluated  exactly  but  may  be 
found  by  numerical  integration.  In  doing  so,  the  form  (26)  is  inconvenient 
because  of  the  singularity  in  the  integrand  arising  from  the  zero  of  H  at  0=0^ 

The  singularity  is  merely  in  thfe  derivative  of  the  integrand  of  (27)  and  that 
integral  was  evaluated  successfully  by  the  IMSL  subroutine  DCADRE. 
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The  deformed  shape  of  the  string  can  be  obtained  in  closed  form  by  elimina¬ 
ting  0  between  (21)  and  (24),  which  leads  to  the  relation 

n  *  -6  +  Q^'^^p'nan0^sin[pQ“^'^^{5  -  A(Q  +  ccse<.,)}]  (28) 

Thus  the  wetted  part  of  the  sheet  is  shaped  like  a  portion  of  a  sine  curve. 

The  parameters  Qo  and  ^^(or,  equivalently,  Q)  have  yet  to  be  evaluated. 
For  this  purpose  we  use  the  boundary  conditions  (15)  in  the  form  5  =  z  =  1  at 
0=0.  These  lead  to  the  equations 

1/7  “1 

A(Q  +  cos0^)  +  Q  P  ii/2  -1=0  (29) 

A  +  Q^^^p'hd  +  ^  (30) 

where 


I 


X 


-2 

(Q  +  cost)  ^^(t,e  )sintdt 

t=e 

0 


(31) 


Taking  A  and  p  as  given  quantities,  these  equations  were  solved  for  Q^using  the 
IMSL  version  of  the  Brent  algorithm.  If  0^  is  assumed,  Equation  (29)  is  merely 
quadratic  in  Q  and  can  be  solved  easily.  Then  the  function  on  the  left  of  (30) 
is  evaluated  for  these  values  of  and  Q,  and  the  Brent  algorithm  generates 
successive  values  of  until  Equation  (30)  is  satisfied  with  sufficient 
accuracy. 

When  the  solution  for  0^  has  been  obtained,  the  quantities 

e  =  Qseco 
0  0 

6  =  -(1  +  e  )Asin0 

0  o 

X  =  -E  sine 

o  o 

n(l)  =  -6  +  Q^'^^p"Uan0^ 
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can  be  found.  The  relation  between  z  and  e  is  expressed  by  Equation  (27),  and 
Equations  (21)  and  (24)  then  give  the  deformed  positions  of  points  parametri¬ 
cally  in  terms  of  their  initial  locations. 

In  applying  these  formulas  we  shall  usually  regards  as  the  load-intensity  and  p 
as  a  parameter  characteristic  of  the  material -geometrical  situation.  The  forms 
of  the  solutions  are  such  that  for  most  variables  of  interest  the  dependence  on  x 
and  p  can  only  be  obtained  implicitly.  However,  it  is  informative  to  examine  the 
small -deflection  limit,  for  which  explicit  formulas  can  be  found.  This  deriva¬ 
tion  requires  some  care  and  is  given  in  the  Appendix.  The  results  are  as  X  ->■  n 


-  A  ~  2'^^®[Tr/(2p)]X^^^  '  1.399p‘^^''^ 

(32) 

-  -e  ~  -n(l)  '  (2X)^''^  '  1.260X^/^ 

0 

(33) 

'  Q  '  2’^'^^X^^^  =  0.794X^'^" 

(34) 
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4.  Results  and  Discussion 

Although  the  relation  between  initial  and  deformed  positions  of  particles  on 
the  sheet  is  of  some  interest,  we  are  primarily  concerned  with  finding  the 
effects  of  the  parameters  x  and  p  on  the  quantities  A  (which  specifies  the  half- 
length  of  the  puddle),  9  (maximum  slope  angle),  e  (maximum  strain  or  dimension- 
less  stress),  n(l)  (dimensionless  central  deflection)  and  6  (dimensionless 
displacement  of  the  puddle  surface).  Typical  values  for  the  physical  parameters 
in  anticipated  applications  of  these  results  are 

L  =  3  meters,  y  =  1,000  Kg/meter^,  Tq  =  360  newton/mm 
Since  all  calculations  are  done  per  unit  sheet-width  perpendicular  to  the  plane  . 
of  motion,  a  typical  value  for  p  is 

p  =  L(y/T^)’^^  :  1/2  . 

First,  for  illustrative  purposes  we  display  in  Figure  3  two  examples  of  the 

deformed  shape  (when  p  =  1/2)  of  the  sheet,  for  A  =  0.4,  X  =  .01256,  and  A=  0.8, 
-3 

X  ®  .387  X  10  .  The  actual  weights  of  water  on  these  sheets  are  452  and  13.9 
kilograms  per  meter  of  width,  respectively.  The  more  highly  stressed  of  the 

two  cases  (A  =  0.4)  suffers  a  maximum  strain  e  =  0.039.  It  is  thought  that  this 

0 

sheet-material  tears  at  about  e  =0.1,  consequently  failure  is  not  expected  in 
either  case.  Comparing  the  initial  and  deformed  positions  of  points  in  these 
graphs,  we  observe  that  the  x-displacements  are  much  smaller  than  the  y-displace- 
ments. 

Figures  4  through  8  show  plots  of  ti(i)  ,  A  and  6  as  functions  of  x 

for  p  =  .2,  .5  and  .8.  These  graphs  imply  two  main  conclusions.  The  first. 
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perhaps  rather  simple,  is  that  for  any  p  there  is  a  maximum  total  liquid  weight 
that  the  deformed  sheet  can  carry.  For,  when  the  entire  sheet  is  wetted,  A  =  6 
=  0,  any  additional  liquid  will  simply  overflow  at  the  ends  of  the  sheet.  These 
maxima  can  be  seen  in  Figure  7  as  the  values  of  \  for  which  A  =  0.  The  relation¬ 
ship  between  this  maximum,  or  fully-wetted,  load,  ,  and  p  is  exhibited  in 
Figure  9.  This  shows  that  increases  with  p  and,  for  example,  x  =  x^  =  0.3  when 
P  =  0.8. 

The  second  main  point,  visible  in  Figures  4,  5,  and  6,  is  that  p  does  not 
have  much  effect  on  the  relations  between  x  and  and  n(l)  in  the  range 

.2  =  p  =  0.8.  This  agrees  with  the  asymptotic  behaviors  of  £^.9^  and  nd)  for 
small  deflections.  Equations  (33)  and  (34),  which  show  no  effect  of  p  .  Observe, 
however,  that  A  and  6  are  affected  by  p  ..  Indeed,  Equation  (32)  and  Figure  7 
indicate  that  p  affects  the  relation  between  A  and  x  for  all  relevant  x  values, 
i.e.  for  0  “  X  2  x^.  Equation  (33)  asserts  that  the  relation  between  5  and  x.  is 
not  influenced  by  p  when  x  ->  o,,  but  the  effect  of  p  is  felt  when  x  =  1  x  10  , 
according  to  Figure  8. 

Systematic  comparisons  between  the  asymptotic  formulas  (32),  (33)  and  (34) 
and  Figures  4  through  8  show  that  the  asymptotic  formulas  for  e^,e^,  nd)  and 
A  are  almost  good  enough  to  be  used  throughout  the  range  p=o.8.  The  asymptotic 
formulas  are  of  course  most  accurate  for  small  %  ,  but  the  graphs  show  that  their 
errors  are  not  too  bad  even  when  x  is  fairly  large,  say  x  *  0(10  ^). 

In  particular,  the  estimate 
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is  quite  good  even  near  x  =  1  and  can  almost  be  regarded  as  a  universal  formula 
for  p  =0.8.  The  asymptotic  predictions  of  ,  n(l)  and  A  are  less  accurate 
but  still  usable  as  rough  estimates  for  x  =  o(lo'’)  .  The  simple  formula  (33) 

for  <5.  is  accurate  only  for  very  small  x  and  completely  misses  the  dependence  of  <5 
on  P- 

We  present  next  an  elementary,  somewhat  contrived,  application  of  these 
results,  namely  to  estimate  the  rain  conditions  under  which  a  sheet  with  T  =  36 
newton/cm,  width  =  6  meters,  i.e,  L  =  3  meters,  will  break.  We  assume  that  the 
sheet  tears  when  e  =  0.1.  The  asymptotic  formulas  tell  us  that  tearing  occurs 
when 

X  =  =  Ml 

w  =  XT^  =  1  ,609kg/ni 

Hence  the  volume  of  water  per  meter  of  sheet  width  is  1.609  tn^  and  so  tearing 
occurs  when  the  average  depth  of  water  on  the  sheet  is  1.609/3  =  .536  m  (about  21 
inches). 

If  rain  falls  at  the  rate  of  .025  m  (an  inch)  per  hour,  the  sheet  will  tear 

in  about  21  hours.  Just  before  the  sheet  tears  we  have  from  (33)  that 
-nd)  “  -e  *  1 .260(.04i<7)’'^^  =  .^^7  . 

O 

Since  p  =  0.5,  Equation  (32)  gives 

A  =  1  -  1.399  (.0447)’''^  /.5  =  .01. 

The  relative  accuracy  of  this  estimate  for  A  is  rather  poor  since  the  true  value 
is  A  =  .09.  Despite  this,  the  approximation  is  qualitatively  correct  in 
asserting  that  the  sheet  is  almost  completely  engulfed  when  it  tears.  The  depth 
of  water  is  about  1.3  meters  (4  feet)  in  the  center  of  the  puddle. 
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5.  Conclusions 

We  conclude  that  this  solution  provides  quite  a  bit  of  information  about  the 
behavior  of  wide  flexible  sheets  during  a  rainstorm.  The  solution  may  also  be 
useful  in  assessing  the  quality  of  finite-element  codes  that  purport  to  solve 
geometrically  nonlinear  elastic  problems.  There  is  not  at  present  a  wide  range 
of  such  solutions  for  comparison  with  finite-element  results.  The  solution  given 
here  is  not  ideal  for  that  purpose  (some  error  is  inherent  in  the  numerical  in¬ 
tegration  and  solution  by  the  Brent  algorithm)  but  is  probably  good  enough  to 
provide  a  meaningful  standard. 

The  example  suggests  that,  if  the  assumed  parameters  are  typical  of  those  in 
common  use,  tearing  will  occur  only  in  a  phenomenally  heavy  or  prolonged  rain¬ 
storm.  However,  lesser  storms  may  cause  unacceptably  large  deflections  at  the 
center  of  the  sheet. 

With  minor  modifications  this  solution  can  be  employed  to  study  the  more 
realistic  case  where  the  edges  of  the  sheet  are  attached  to  the  ends  of  elastic 
columns.  The  solution  in  that  case  is  more  complicated  than  here,  but  not  as 
much  so  as  might  be  expected,  for  the  following  reason.  In  the  present  problem 
the  sheet  has  fixed  ends,  and  the  only  inextensional  solution  is  trivial.  When 
the  ends  are  allowed  to  move,  a  nontrivial  inextensional  solution  exists  and  is 
not  much  more  difficult  to  find  than  the  present  one. 


O 
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Appendix 

The  abjective  is  to  find  the  asytotic  behavior  as  A  -»■  I  and  0^  0  of  the 

solutions  to 

A(Q  +  cos0^)  +  ^ti/2  '■  1  =•  0  (A.l) 

A  +  Oy^p"’[(l  +  (i)’’ir/2  -  1^1  -  1  =0  (A. 2) 

where  « 

—2 

I  =  'l<(t,9  )((l  +  cost)  sintdt  (A. 3) 

^  In  ^ 


We 


H’(t,0  )  “  (ti/2)  -  sin  *(tant/tan0  ) 

'0  0 

start  by  estimating  Since  6^  is  small 
Q.  +  cost  ~  1  +  (i  -t^/2  r  1  +  Q 


(A.M 

and  negative  so  is  t,  hence 

(A.  5) 


sint  =  tant  ^  t  ,  sine  *  6 

0  0 


I  :  (i+a)' 


[ (n/2)  -  sin  *  (t/e^)]tdt 


and 


=  (l+Q)~^["(TTe^/M  “  [  tsin  '  (t/0  )dt 
0  Jq  0 

0 

fP  -1 

1  tsin  (t/0  )dt  “  -Tf0^/8  . 

Jfl  0  O’ 


Thus  we  find 


I  =  -  (l+Qj'^TreVS  .  (A. 6) 

If  the  t^/2  term  were  retained  in  equation  (A. 5),  a  negligible  correction  to  this 
estimation  would  result. 

Now  we  define 

Y  B  1  “  A  >  0,  =  a6  >  0,  a  =•  2p/Tr  (A.?) 
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and  use  the  approximation 

COS0  :  1  -  0^/2 
0  o 

in  equation  (A.l)  to  obtain 

02/2  ;  [(e  -  y)/(1  -  y)]  +  oi2g2 

It  is  evident  now  that  as  0^  0  and  A  ->  1 ,  i  .e.  y  0,  we  must  have  B  0. 

We  def i ne 

<!>  =  B  -  Y  (A. 8) 

where  ](►!  «  B,y,  and  so  can  write 

02/2  :  [(|)/(1  -  y)]  +  (A. 9) 

We  do  not  yet  know  the  relative  magnitudes  of  (J)  and  ^2  Yq  clarify  this  we 
observe  that  (A. 2)  can  be  written  with  the  aid  of  (A, 6)  and  (A. 7)  as 

'•y(J  +  a3)^  +  B(1  +  olB)  +  302/1^  =  o  . 

0 

Using  equations  (A. 8)  and  (A. 9)  leads  eventually  to 

0 

(A. 10) 


The  other  quantities  of  interest  are  calculated  now,  as  follows: 

e  ®  Q/cos6  -  a2Y2 
o  0 

X  =  -e  sin0  ~  2'^^a^Y^ 
o  0 

6  =  -A(l  +  e  )sin6  -  2^^^aY 
0  0 

q(l)  =  -6  +  Q’^%'’tan0  ~ 

O  ' 


2(<|)  ^  +  a^6^)  +  B<|>/(l  -  y)  - 

which  is  solved  for  ((»  to  obtain 

(|)  -  a2Y^ ,  3  -  y(1  ot2Y2) 

a  -  a2Y2(l  +  2a2Y2) 

Combining  this  with  (A. 9),  the  estimate 

@2/2  ■>.  a2Y2  +  a2Y^/(l  -  y)  '  a2Y2 


i  s  found . 
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When  these  formulas  are  re-expressed  in  terms  of  X  ,  Equations  (32),  (33)  and 
(34)  are  obtained. 


Figure  Captions 


Figure  1: 
Figure  2: 
Figure  3: 

Figure  4: 

Figure  5; 

Figure  6: 

Figure  7: 

Figure  8: 

Figure  9: 


Sketch  of  the  Deformed  String 

Load  on  an  Element  of  the  Deformed  String 

Deformed  Shapes  of  the  String  for  p  =  1/2.  Initial  and  Deformed 
Positions  for  A  =  0.4  (o)  and  A  =  0.8.  (x). 

Dependence  of  Strain,  ,  on  Total  Weight, x  ,  for  p  =  0.2,  (o),  0.5 
(x)  and  0.8  ( a). 

Dependence  of  Angle,  9^  ,  on  Total  Weight,  X  ,  for  p  =  0.2,  (o),  0.5 
(x)  and  0.8  (A) . 

Dependence  of  Central  Deflection,  n,(l)»  on  Total  Weight,  x  ,  for  p 
=  0.2,  (o),  0.5  (x)  and  0.8  (a). 

Dependence  of  Puddle  Edge  Position,  A,  on  Total  Weight,  X  ,  for  p  = 
0.2  (o),  0.5  (x)  and  0.8  (a). 

Dependence  of  Puddle-Surface  Displacement,  6,  ,  on  Total  Weight,  x  , 
for  P  =  0.2  (o),  0.5  (x)  and  0.8  (a). 

Dependence  of  maximum  Load  x^on  Parameter  p  . 
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Figure  1:  Sketch  of  the  Deformed  String 
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Figure  4;  Dependence  of  Strain,  e  ,  on  Total  Height, 
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Figure  5:  Dependence  of  Angle,  9  ,  on  Total  Weight,  \  .  for 
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Figure  7:  Dependence  of  Puddle 
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Figure  8;  Dependence  of  Puddl 


RELATIVISTIC  WAVE  EQUATIONS  FOR  SOLIDS 
AND  LOW  TEMPERATURE  QUANTUM  SYSTEMS 

Richard  A.  Weiss 
Environmental  Systems  Division 
U,  S*  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180-0631 


ABSTRACT.  The  local  scale  invariance  of  relativistic  thermodynamics  is 
established  and  a  set  of  coupled  relativistic  wave  equations  is  developed  that 
describe  the  propagation  of  small  amplitude  waves  in  solids  and  low  temperature 
interacting  Fermi  and  Bose  systems,  A  first  order  expansion  calculation  is 
done  in  order  to  obtain  the  wave  equations  from  a  basic  set  of  material  equa¬ 
tions.  The  wave  equations  determine  the  relativistic  energy  density  and 
Griinelsen  parameter  for  small  amplitude  mechanical  or  electromagnetic  waves - 
In  turn,  the  wave  amplitude  and  phase  velocity  are  obtained  from  the  energy 
density  and  Grlineisen  parameter  of  the  waves.  At  low  pressures  the  wave 
amplitudes  are  determined  to  be  not  greatly  different  from  those  predicted  by 
a  nonrelativistic  calculation,  but  at  high  pressures,  such  as  those  expected 
to  occur  under  nuclear  blast  loading  or  in  stellar  compact  objects,  the  cal¬ 
culated  relativistic  wave  amplitudes  can  be  considerably  larger  than  the 
corresponding  nonrelativistic  predictions. 

1 .  INTRODUCTION .  Local  gauge  invariance  has  become  a  powerful  tool  in 
modern  physics. ^  It  has  unified  the  electromagnetic  force  with  the  weak 
nuclear  force,  and  possibly  also  with  the  strong  nuclear  force,  into  a  single 
entity  described  by  local  non-Abelian  gauge  invariance  of  the  fields.^  The 
requirement  of  local  gauge  invariance  necessitates  the  introduction  of  new 
fields  which  obey  sets  of  coupled  differential  equations  which  lead  to  uni¬ 
fication  of  the  fields  in  a  natural  way.  The  gauge  transformations  corre¬ 
spond  to  generalized  rotations  of  the  fields  in  the  manner  where 

cji(x)  depends  on  space  and  time  for  local  gauge  invariance. 

A  theory  of  relativistic  thermodynamics  has  been  developed  for  solids 
and  quantum  liquids  which  is  based  on  a  set  of  coupled  differential  equations 
for  the  zero  temperature  values  of  the  Grlineisen  parameter  and  pressure,^ 

These  coupled  equations  are  derived  from  a  relativistic  trace  equation  that 
has  been  gauged  (scaled)  by  the  introduction  of  the  Grlineisen  parameter.  The 
fact  that  an  additional  field  (the  Grlineisen  parameter)  has  to  be  considered 
in  order  to  calculate  the  relativistic  pressure  suggests  an  invariance  of  the 
system  similar  to  local  gauge  invariance*  Electromagnetism  and  the  electro- 
weak  interactions  are  examples  of  fields  that  are  described  by  local  gauge 
theories  in  which  the  gauging  Is  accomplished  by  the  Introduction  of  addi¬ 
tional  fields*  The  Invariance  of  relativistic  thermodynamics  refers  to 
different  values  of  pressure  and  energy  density;  and,  in  analogy  to  a  local 
gauge  transformation,  is  manifested  through  a  real  exponential  corre¬ 

sponding  to  changes  of  the  local  scale  of  the  pressure  and  energy  density  of 
a  system,^  This  paper  shows  that  relativistic  thermodynamics  is  scale  in¬ 
variant,  and  that  the  local  symmetry  group  for  relativistic  thermodynamics 
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Is  the  unlmodular  group  of  scale  transformations 

The  trace  equation  of  relativistic  thermodjmamlcs  is  written  as- 


P^V 


(1) 


where  U  =  relativistic  internal  energy,  P  =  relativistic  pressure,  T  =  absolute 
temperature,  V  =  volume  per  mole  of  substance,  and  U®  and  P^  =  corresponding 
nonrelativistic  internal  energy  and  pressure.  Throughout  this  paper  the 
index  "a”  will  refer  to  nonrelativistic  calculations.  It  has  been  shown  that 
for  a  physical  system  described  by  U  =  Uq  +  tJjTJ+  and  P  =  Pq  +  PjTJ  +  ••• 

(such  as  the  high  temperature  Mie-Gruneisen  state  equation  with  j  =  1  and  the 
Debye  state  equation  with  j  =  4)  the  trace  equation  (1)  is  equivalent  to  the 
following  set  of  coupled  differential  equations^ 


3V 


2^ 

dV^ 


dP 


+  3(3  +  Y„>v  ^  +  [3(y^  +  K 


jy  P  dy 

^  -  K  dV 

o  o 


o  o  ' 


where  the  internal  energy  coefficients  are  given  by^ 


(2) 


(3) 


(4) 


and  where  Pq,  Kq  and  Yq  “  zero  temperature  values  of  the  relativistic  pressure, 
incompressibility  (  -VdPjj/dV)  and  Grlineisen  parameter  respectively,  and  P®, 
and  Yq  are  the  corresponding  nonrelativistic  values  of  these  quantities.  Eqs. 
(2)  and  (3)  are  a  set  of  coupled  nonlinear  differential  equations  for  Pq  and 
Yq.  The  zero  temperature  value  of  the  Griineisen  parameter  is  the  T  =  0  limit 
of  the  Griineisen  parameter  defined  as^ 


(5) 


which  for  a  temperature  dependent  Debye  temperature  becomes,®*^ 
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(6) 


where  Cy  -  heat  capacity  and  0j)  =  Debye  temperature.  The  Debye  temperature 
is  generally  a  function  of  V  and  T-  Equation  (6)  gives  the  general  nonrel¬ 
ativist  ic  expression  for  the  Grlineisen  parameter. 

Wave  propagation  in  matter  is  complicated  by  the  fact  that  matter  is  a 
thermodynamic  system  which  is  described  by  physical  quantities  such  as  the 
heat  capacity  and  Griineisen  parameter. ®  Further  complications  arise  be¬ 
cause  matter  is  often  prestressed  as  for  instance  by  gravitation  in  a  star 
or  planet.^  Various  assumptions  are  used  to  calculate  the  phase  velocity  and 
wave  amplitude  in  terms  of  the  thermodynamic  state  equation  of  a  material 
medium,  but  no  completely  general  procedure  exists  for  calculating  these 
quantities •  This  paper  develops  a  relativistic  calculation  of  the  phase 
velocity  and  amplitude  for  mechanical  waves  in  thermodynamic  media. 

Specifically ,  this  paper  develops  a  set  of  coupled  first  order  relativ¬ 
istic  equations  that  govern  small  amplitude  wave  propagation  in  thermodynamic 
systems  such  as  solids  and  low  temperature  Fermi  and  Bose  liquids.  These 
equations  determine  the  relativistic  energy  density  and  Grlineisen  parameter 
for  mechanical  or  electromagnetic  waves.  The  phase  velocity  and  wave  ampli¬ 
tude  can  be  obtained  from  the  energy  density  and  Grlineisen  parameter  for  the 
waves,  and  are  expressed  in  terms  of  the  material  parameters  of  the  thermo¬ 
dynamic  ground  state  of  the  material  system. 

The  wave  equations  are  developed  from  a  small  amplitude  perturbation 
expansion  of  a  set  of  relativistic  material  equations.  This  insures  that 
the  derived  wave  equations  are  intimately  connected  to  the  material  parameters 
of  the  thermodynamic  medium.  A  complicated  set  of  nonlinear  wave  equations 
are  derived  whose  complete  solution  requires  numerical  computer  techniques* 
however,  an  order  of  magnitude  approximate  solution  is  given  for  the  case  of 
elastic  waves  in  solids.  The  procedure  followed  in  this  paper  is  to  first 
review  local  scale  invariance,  then  the  relativistic  ground  state  calcula¬ 
tion,  and  then  the  relativistic  wave  equations  that  govern  excitations  in  a 
thermal  medium. 


2.  LOCAL  SCALE  INVARIANCE,  The  fact  that  an  additional  field  YqCV) 
must  be  introduced  in  order  to  calculate  the  zero  temperature  pressure  Pq(V) 
as  in  Eqs,  (2)  and  (3)  suggests  that  the  relativistic  trace  equation  (1)  may 
be  invariant  under  a  local  scale  transformation  of  the  form  exp  [-“<t>(V,T)]  * 
The  scale  invariance  of  Eq.  (1)  will  now  be  demonstrated.  To  do  this  the 
following  elementary  thermodynamic  relationships  are  used^ 
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where  Kt  =  -V(3P/3V)j  =  isothermal  incompressibility.  Using  Eqs.  (7)  and 
(8)  allows  Eq.  (1)  to  be  written  as 


where 

T(3P/3T) 


(9) 


(10) 


T(3P®/3T)^ 
(P^  -  K^) 


(11) 


Note  that  the  Griineisen  parameter  y  defined  in  equation  (5)  is  not  independ¬ 
ent  of  the  quantity  b  defined  by  equation  (10).  Eq*  (9)  can  be  rewritten  in 
a  more  S5anmetrical  form  by  writing  U  =  EV,  where  E  =  energy  density,  with 
the  result 


-  b  +  T  -  bV  gyjt  -  3^1  +  v  +  V  gy  "yT  g-p^P  (12) 

.(l-b*.H^-b%^)E» 

Eq.  (12)  can  be  written  as 

(Hj  -  Wj)E  -  3(H2  -  W2)P  -  (hJ  -  W®)E®  (13) 

where 
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H  =  V  —  -  yT 
2  3V  ^  3T 


W  =  b  -  1 


(15) 

(16) 


W2  =  -Y  -  1  (17) 

Eq.  (13)  is  in  a  form  suitable  for  demonstrating  local  scale  invariance. 

Introduce  the  following  scale  transformations  E  ->  E’  =  Ee~'^  and 
?-»■?'=  Pe”*!’  where  ip  and  ((>  are  functions  of  V  and  T.  Also  for  the  nonrela- 

3i  3-  ^ 

tivistic  energy  density  introduce  the  scale  transformation  E  E  =  E  e  ^ 
where  is  also  a  function  of  V  and  T.  Under  these  scale  transformations 
y  and  b  assume  new  values  Y  y’  and  b  b’  ,  and  equations  (14)  through  (17) 
become 


^  3T  “  ^  ^  3V 


(18) 


«2 


V  - y'T  ~ 

3V  '  3T 


W 


=  b'  “  1 


=  -y'  -  1 


(19) 

(20) 
(21) 


where  b'  and  y’  are  to  be  determined  by  the  local  scale  invariance  conditions. 
The  local  scale  invariant  conditions  for  the  operators  in  equation  (13)  are 
written  in  a  manner  similar  to  local  gauge  invariance  as^ 


(H’  -  WpEe"’*'  *  e"'''(H^  -  W^E 
(H^  -  WpPe'***  =  e"**’(H2  -  W2)P 
(H^'  -  W^’)E^  -  W^)E^ 


(22) 

(23) 

(24) 


Equations  (22)  through  (24)  allows  the  trace  equation  (13)  to  be  written  as 
e'^(H|  -  wpe"’''E  -  3e'*’(H^  -  Wpe"‘*’p  =  e'''^(H®'  -  Wj')e“’*’V  (25) 
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so  that  the  following  operator  equations  hold 

-  Wj  =  -  wpe"'''  (26) 

H2  -  -  wpe"'*’  (27) 

The  values  of  b'  and  y'  are  obtained  as  follows.  Placing  equations  (14) 
through  (21)  into  the  scale  invariant  conditions  given  by  equations  (22)  and 
(23)  yields  the  following  results 


iKLill 


==  Y  + 


p/y  _  yT  — ) 

\  av  ^  3t/ 

P  -  T  ~  +  PT  ^ 
BT  BT 


Now  since  b'  and  y'  are  associated  E'  and  P'  respectively,  while  b  and  y 
are  associated  with  E  and  P  respectively,  one  can  write 


b'  =  b  +-^(E'  -  E)  + 


y’  =  y  +  -^(P*  -  P)  + 


while  from  E'  =  Ee  and  P'  =  Pe  it  follows  that 


E'  -  E  =  E(e  -  1) 


P'  -  P  =  P(e~'’’  -  1)  (33) 

so  that  combining  equations  (28)  through  (33)  yields  the  following  differen¬ 
tial  equations  for  \p  and  <)), 


T  ^  -  bv5i 

3T  3V 

g  +  y  _  gy 

3V  3V 
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(35) 


ix  = 

dP 


’  YT  ^  _  V  ii  \ 
^  ST  9V  \ 

P  -  T  ~  +  PT  / 

8T  3T  / 


Therefore  it  is  always  possible  to  find  a  and  4)  such  that  the  trace  equa¬ 
tion  (13)  is  locally  scale  invariant. 

Consider  now  the  infinitesimal  local  scale  transformation  where  4*  and  4i 
are  small  quantities,  then  equations  (32)  and  (33)  become 

E'  -  E  =  -4'E  _  (36) 


P'  .  p  =  _(|)p 

and  equations  (34)  and  (35)  become 


db 

dE 


T 

\|j  9T 


E  + 


av 


h  V 

4;  av 


-  EV-^ 

av 


dy 

dP 


T  _  V  ^ 
4)  3T  4>  9V 


P  -  T 


iP  +  PT 

aT  aT 


,(37) 


(38) 


(39) 


Therefore  it  is  always  possible  to  find  a  and  (j>  such  that  the  trace  equa¬ 
tion  (13)  is  invariant  under  an  inf inltesiinal  local  scale  transformation. 
Note  that  for  the  case  when  b  and  Y  slowly  varying  functions,  equations 

(38)  and  (39)  yield 


b 

. ,  c'  \'J 

'  a  V 


(40) 


Y  = 


av 


aT 


(41) 


It  turns  out  that  the  solutions  to  equations  (40)  and  (41)  are  very  simple. 
Choose  4*  and  4>  as  follows 
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PV 


4>  = 


P  V 
1  i 


(A2) 


= 


(43) 


where  PjVj^  =  an  initial  value  of  PV.  Then  from  equations  (40)  and  (41)  it 
follows  that 


b  = 


3T 


(P  -  K^) 


(44) 


Y 


jl!!b 
'  % 

‘  ■  »D  ” 


(45) 


Equation  (44)  is  just  the  basic  definition  of  the  quantity  b  given  in  equa¬ 
tion  (10),  while  equation  (45)  is  just  the  standard  result  for  the  Griineisen 
parameter  given  in  equation  (6) .  Therefore  the  values  of  4^  and  given  by 
equations  (42)  and  (43)  are  the  proper  potential  functions  for  the  infini¬ 
tesimal  local  scale  transformation  of  relativistic  thermodynamics  for  the 
case  of  slowly  varying  values  of  b  and  y*  The  explicit  determination  of 
and  (f>  for  the  general  case  of  local  scale  invariance  given  by  equations  (34) 
and  (35)  is  much  more  complicated,  and  has  not  been  accomplished* 

The  scale  invariance  of  the  trace  equations  (1)  or  (9)  requires  the 
presence  of  the  parameters  b  and  y,  although  as  mentioned  earlier,  b  and  y 
are  not  independent.  This  means  that  for  the  zero-temperature  case,  the 
zero-temperature  values  of  the  pressure  and  the  Griineisen  parameter  must  be 
determined  simultaneously  as  shown  in  equations  (2)  and  (3) .  The  establish¬ 
ment  of  local  scale  Invariance  of  the  relativistic  trace  equation  gives  one 
confidence  to  use  equations  (2)  and  (3)  to  determine  the  equations  govern¬ 
ing  the  propagation  of  small  amplitude  waves  in  a  thermodjmamic  medium.  But 
first  the  ground  state  of  the  thermodynamic  system  must  be  described. 


3.  GROUND  STATE*  The  nonrelativistic  state  equation  of  the  ground 
state  of  a  thermal  system  is  assumed  to  have  the  following  form^ 

+  E^T^  +  •••  (46) 

o  J 

=  P^  +  pV  +  •••  (47) 

o  J 
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Q  3- 

where  E  and  P  =  nonrelativistic  energy  density  and  pressure  respectively > 

Eq  and  pQ = nonrelativistic  zero-temperature  values  of  the  energy  density  and 
pressure  respectively,  Ej  and  Pj  =  nonrelativistic  thermal  coefficients  for 
the  energy  density  and  pressure  respectively,  T  -  absolute  temperature  of  the 
system  (^K) ,  and  j  =  numerical  index  having  values  characteristic  of  the  type 
of  physical  system.  Typical  examples  of  systems  that  are  described  by  equa¬ 
tions  (46)  and  (47)  are^ 

j  =  1  high  temperature  solid 
j  =  2  low  temperature  Fermi  gas 
j  =  5/2  low  temperature  molecular  Bose  gas 
j  =  4  low  temperature  solid 

A  commonly  used  descriptor  of  the  thermal  state  equations  given  by  equa¬ 
tions  (46)  and  (47)  is  the  nonrelativistic  zero-temperature  value  of  the 
Griineisen  parameter  that  is  defined  by^ 


1 

(j-1) 


—  ~  (ve!) 

i-a  dV  J 


(48) 


except  for  j  =  1  .  Here  y§  “  nonrelativistic  zero-temperature  value  of  the 
Griineisen  parameter,  and  V  =  volume  of  the  material  system.  When  j  -  1  , 

=  2/3  . 

The  corresponding  relativistic  state  equations  will  be  written  as 

E  -  E  +  E.T^  -f  •••  (49) 

o  J 

P  =  P  +  P.T^  +  •••  (50) 

o  1 


y 


o 


P. 

E. 

3 


(j-1) 


1 


(51) 


except  for  j  =  1  ,  when  Yq  -  2/3  ,  and  where  Eq  and  =  relativistic  zero- 
temperature  energy  density  and  pressure  respectively,  Ej  and  Pj  =  relativis¬ 
tic  thermal  coefficients  for  the  energy  density  and  pressure  respectively, 
and  Yq  ”  relativistic  zero-temperature  Griineisen  parameter* 


The  relativistic  values  of  the  zero- temperature  energy  density  and 
Griineisen  parameter  for  the  ground  state  thermodynamic  system  are  given  by 
the  solution  of  the  following  two  coupled  equations^ 


E 

o 


-  3[(1  +  Y  )P  -  K  ]  =  E® 
^  o  o  o  o 


(52) 
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(53) 


where  n  =  1/V  =  reciprocal  volume 

dP^ 

Kq  “  n  =  relativistic  zero-temperature  bulk  modulus 
dP® 

3  0 

Ko  =  n  ~  nonrelativistic  zero- temperature  bulk  modulus 


From  equations  (48)  through  (51) 


it  follows  that^ 


Ej  =  nCj  exp  [  -  (j-1) 

Ej  =  nCj  exp  [  -  (j-1) 

SO  that 

^  exp  [  -  (j-1)  /  (Y^  -  V  ^  ^56) 

When  Yq  “  Yo  »  must  follow  that  Ej  -  Ej  so  thAt  quite  generally  Cj  -  Ca  . 
Combining  equation  (56)  with  equation  (53)  shows  that  equations  (52)  and  T53) 
are  two  nonlinear  coupled  equations  for  determining  E^  and  Y^  in  terms  of  the 
known  values  of  E^  and  ,  These  equations  give  the  recipe  for  calculating 
the  relativistic  ground  state  of  a  thermal  system  in  terms  of  the  correspond¬ 
ing  nonrelativistic  description  of  the  ground  state.  The  calculation  of  the 
relativistic  excited  states  will  now  be  given. 


/  V„  ^  ]  (54) 

^  on 

/”  V  V  J  (55) 

'on 


4 .  EXCITATIONS ,  The  excitations  in  thermal  media  that  are  considered 
in  this  paper  are  mechanical  radiation  and  electromagnetic  waves.  Only 
waves  of  small  amplitude  are  treated.  The  thermal  state  equations  of  the 
radiation  are  assumed  to  have  a  form  similar  to  the  ground  state  equations 
(46),  (47),  (49)  and  (50)  and  are  written  as 

E^  =  E®  +  E!  T^  +  •  •  •  (57) 

r  or  jr 

=  P®  +  T^  +  •••  (58) 

r  or  jr 
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and 


E  =  E  +  E.  +  ••• 
r  or  jr 


P  =  P  +  P.  T-'  + 
r  or  jr 


(59) 

(60) 


where 


and 

or 


or 


nonrelativistic  zero- temperature  radiation  energy 
density  and  pressure  respectively 


and 

jr 


nonrelativistic  thermal  coefficients  for  the 
radiation  energy  density  and  pressure  respectively 


E 

or 


and  P 

or 


relativistic  zero-temperature  radiation  energy 
density  and  pressure  respectively 


and  P . 


relativistic  thermal  coefficients  for  the  radiation 
energy  density  and  pressure  respectively 


It  will  be  shown  in  this  paper  that  the  relativistic  state  equations  for 
radiation  given  in  equations  (59)  and  (60)  must  have  nonzero  thermal  compo¬ 
nents  even  when  the  corresponding  nonrelativistic  thermal  components  that 
appear  in  equations  (57)  and  (58)  are  taken  to  be  zero.  Finally,  the  radia¬ 
tion  terms  are  assumed  to  be  much  smaller  than  the  ground  state  terms,  i.e,, 
E^  <<  E  and  P^  <<  P  . 


When  radiation  is  present  in  a  system  whose  ground  state  is  described 
by  pQ  ♦  ^o  »  *^o  ’  these  three  parameters  become  P^  hh  p^^  ,  , 

and  +  6^^  ,  where  K  =  relativistic  zero-temperature  incompressibility 
associated  with  the  radiation,  and  6^^  -  incremental  change  in  the  relativ¬ 
istic  Grlineisen  parameter  of  the  system  due  to  the  presence  of  radiation. 
The  bulk  modulus  (incompressibility)  associated  with  the  radiation  is  given 
by 


dP. 

K  -  n  , 
or  dn 


or 


(61) 


The  increment  in  the  zero- temperature  Grlineisen  parameter  of  the  system  due 
to  the  presence  of  small  amplitude  radiation  is  obtained  from  the  defining 
equation  (51)  by  noting  that  when  radiation  is  present  this  equation  becomes 


Y  + 
'o 


6 

or 


-I-  P. 

_ 

+  E, 


(62) 
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Expanding  the  denominator  in  equation  (62) ,  keeping  only  first  order  terms, 
and  finally  subtracting  equation  (51)  gives 


6 

or 


(63) 


"  Y„) 

or  o 


where  Yor  “  zero- temperature  Griineisen  parameter  for  the  radiation  field 
itself,  and  is  defined  by 


(64) 


jr 

The  corresponding  expressions  for  the  nonrelativistic  bulk  modulus  and 
Griineisen  parameters  associated  with  the  radiation  field  in  matter  are 


dP 


n 


or 


dn 


(65) 


5^  = 
or 


(66) 


Y 


a 

or 


(67) 


From  equations  (63)  and  (66)  it  follows  that  if  Ej^.  and  E^j.  are  small  quan¬ 
tities  then  so  also  are  6^^.  and  6®^.  .  However,  the  radiation  Grunelsen  para¬ 
meters  themselves,  Yq^-  and  y^r  ♦  generally  not  small  quantities  being 
the  ratio  of  two  small  quantities,  and  at  low  pressures  have  the  value  1/3 
for  Isotropic  radiation.^  The  quantities  Eqj-,  Pqt’  Kor>  ^jr*  Pjr.  and  6or  » 
and  their  nonrelativistic  analogs,  are  taken  to  be  small  quantities-  Final¬ 
ly,  from  equation  (64)  and  the  law  of  energy  conservation  (see  Appendix  A), 
it  follows  that  the  ratio  of  thermal  terms  that  occurs  in  equation  (63)  can 
be  written  as 
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(68) 


t  D,  f  n  ,  , 

^  exp  I  -  (j-1)  /  (y^j.  -  Yq)  “ 


where  Djr  =  constant  associated  with  radiation  field. 


5.  DERIVATION  OF  WAVE  EQUATIONS.  When  radiation  is  present  in  the 
thermodynamic  systems  being  considered,  equation  (52)  becomes 


E  +  E  -  3[(1  +  Y  +6  )(P  +  P  )  -  (K  +  K  )]  =  E  +  E  (69) 

o  or  'o  or  o  or  o  or  o  or 


Similarly,  when  radiation  is  present,  equation  (53)  becomes 


1  +  j  + 


P  +  P  “  K  -  K  ^  dn  ^^o  "^orM 

o  or  o  or  -* 


(70) 


(E^  +  e!  ) 

J  jr 


1  +  j  + 


j(Y^  +  6®  ) (P®  +  P®  ) 

0  or  0  or 

P^  +  P^  - 

o  or  o  or  _J 


Equations  (69)  and  (70)  are  the  relativistic  equations  for  waves  in  matter, 
however  they  are  too  complex  for  simple  solutions  to  be  obtained.  Simplifi^ 
cation  can  be  achieved  by  assuming  the  radiation  components  to  be  small 
quantities. 

Considering  only  first  order  terms  in  equation  (69),  and  subtracting 
equation  (52) ,  gives 


E  -  3[(1  +  y  )P  -  K  ]  -  3P  6  =  E®  (71) 

or  ^  o  or  or  o  or  or 

In  a  similar  fashion,  by  expanding  the  denominators  in  equation  (70),  using 
1/(1  +  x)  '''  1  -  X  ,  and  keeping  only  first  order  terms,  and  finally  subtract¬ 
ing  equation  (53)  yields  the  following  result 
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(72) 


d6  jE.P  S 

jE.  (aK  -  3P  )  +  3E,n  ^ 

•'  j  or  or  i  dn  P  -  K 

o  o 


P  dy 

o  .  ~  'o 

„  ^  dir 

o  o  ^ 


-  jE^(a\ 
J  V  ' 


a 

or 


eV^) 


or/  _a 


jE®P®6^ 

+  E 

P^  -  K^ 
o  o 


-( 


1  +  j  + 


•  ^TlS 

p‘-  k” 

o  o 


) 


where 


a  = 


y  P 

o  o 


(P  -  K  )- 
o  o 


a  = 


a_a 
Y  P 
o  o 


(P^  -  K^) 
o  o 


(73) 


6  = 


o  o 


6“  = 


a^a 

^0^0 


(p®  -  K®)^ 
o  o 


(74) 


The  radiation  equations  (71)  and  (72)  will  now  be  written  in  a  much  simpler 
form- 


Equation  (63)  can  be  used  for  6^^.  that  occurs  in  equations  (71)  and 
(72),  while  the  first  derivative  of  6^^  that  occurs  in  equation  (72)  is  eval¬ 
uated  by  using  equations  (63)  and  (68)  and  can  be  written  as  (see  Appendix  B) 


n 


(j-l)(Y 


or 


(75) 


Substituting  the  results  of  equations  (63)  and  (75)  into  the  two  basic  equa¬ 
tions  (71)  and  (72)  yields 


E 

or 


~  3[(1  +  Y^)P 


or 


-  K  ]  -  3  P  (y 
or  Ej  o  or 


(76) 
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and 


3Ej«>Kor  -  +  'j 


.[ 


1  +  j  + 


jP  Y 
^  o  or 

P  -  K 
o  o 


+  3n 


or 


dn 


(77) 


These  two  radiation  equations  can  be  further  simplified  by  using  some  basic 
mathematical  properties  of  the  radiation  field. 

For  radiation,  the  zero-temperature  pressure  is  related  to  the  zero- 
temperature  energy  density  through  the  radiation  GrUneisen  parameter  as 
follows  (see  Appendix  A) 


P 


or 


Y  E 
or  or 


(78) 


This  proportionality  of  the  pressure  and  energy  density  is  characteristic  of 
radiation  fields.  The  bulk  modulus  of  the  radiation  field  can  be  written 
using  equation  (78)  as 


dP 


K  =  n 
or  dn 


or 


Y  j 
or  dn 


dE  dY 

or  .  r-  or 

- +  c  n  ~T~~ 

or  dn 


(79) 


Equations  analogous  to  (78)  and  (79)  hold  for  the  nonrelativistlc  quantities 
and  respectively.  Placing  equations  (78)  and  (79)  into  equation  (76) 
and  (77)  and  then  dividing  equation  (77)  by  Ej  yields  the  two  fundamental 
first  order  coupled  nonlinear  differential  equations  for  Eor  Yor  * 


dE 

In  +  JE  +  g  =  E® 

dn  or  °  or 


(80) 


dE  ^ 

Gn  -  +  SE  +  f  =  4) 

dn  or  ’^or 


(81) 
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where  1  =  3y 


(82) 


or 
dy 

J  =  3n  -  3(1  +  Y  )y  +  1 

dn  'o  or 


(83) 


g  =  3  P  (y  -  y  ) 
E.  o  ’o  *or'^ 
J 


G  =  jay^ 


or 


R 


dY 


or  „  \ 

an  -3 - gY  / 

dn  'or/ 


(8A) 

(85) 

(86) 


■if[' 


f  -  *  *•  I  1  I  1  I  ,  ''^or 

f  -  r  II  +  J  +  p  ■  -  K  +  "  ^ 

o  o 


3(j-l)(Y„^  -  Y^) 


'] 


(87) 


or 


a  p 

J 

j  _ 


1  +  j  + 


jpV  "• 

o  or 

P®  - 
o  o 


dE** 

.  or  .  „a,.a 

+  G  n  -J - +  R  E 

dn  or 


(88) 


E^ 

-  j  cV 

E,  or 
3 


(89) 


=  j 


E. 

J 


(  dY 

a  'or  „a  a 

^  ®  V 


) 


(90) 


The  ratio  Ejj./Ej  that  occurs  in  equation  (84)  and  (87)  is  a  function  of 
Yqr  shown  in  equation  (68) ,  while  the  ratio  E?/Ej  that  appears  in  equa¬ 
tions  (89)  and  (90)  is  given  by  equation  (56).  Equations  (80)  and  (81)  are 
a  set  of  coupled  relativistic  wave  equations  that  determine  the  relativistic 
energy  density  ^or  and  Gruneisen  parameter  Yor  radiation  in  terms  of  the 
corresponding  nonrelativistic  radiation  parameters  E^j.  and  Y^r  »  ®^d  in 
terms  of  the  ground  state  material  parameters  Pq  .  ,  Yq  »  »  Kq  » 


6.  MECHANICAL  WAVES  IN  A  SOLID.  The  temperature  independent  part  of 
the  nonrelativistic  energy  density  for  mechanical  waves  in  a  solid  is  given 
by^O 
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(91) 


or  4 


o  a  a 


where 


k  =  nonrelativistic  wave  number 
a 

-  nonrelativistic  wave  amplitude 

The  nonrelativistic  Griineisen  parameter  (diffuse  radiation  factor)  for  the 
radiation  itself  is  given 


,  dW 

a  _  1  _n _ a 

^or  3  W  dn 

a 


dk 
_ a 

dn 


(92) 


where  “  nonrealtivlstic  phase  velocity  of  the  waves,  and  co  -  lirf 

where  f  =  frequency  of  the  waves*  The  corresponding  relativistic  expres¬ 
sions  are  written 


E 

or 


2  2 
K  k  A 
o 


(93) 


^  M  ^  ^ _ n_  ^ 

^or  3  W  dn  3  k  dn 


(94) 


where 


k  =  relativistic  wave  number 
A  -  relativistic  wave  amplitude 
W  -  (i)/k  =  relativistic  phase  velocity 

As  before  the  relativistic  bulk  modulus  is  obtained  from  a  solution  of 
the  ground  state  material  equations  (52)  and  (53) . 

The  solution  of  the  coupled  equations  (80)  and  (81)  for  the  case  of 
waves  in  a  solid,  described  by  equations  (91)  through  (94),  determines  the 
relativistic  radiation  parameters  k  and  A  in  terms  of  the  nonrelativistic 
radiation  parameters  k^  and  A^  ^nd  in  terms  of  the  material  parameters  in 
the  following  general  forms 


k  =  k(k  ,A  ,P  ,K  ,y  ,P®,K^,Y^,w) 
a  a  o  o  o  o  o  o 


(95) 


A  «  A(k^,A^,P^.K^.Y^.P^.K^.Y^(-) 
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(96) 


The  relativistic  phase  velocity  is  W  =  w/k  ,  so  that  it  also  is  given  by  a 
function  of  the  form 


W  =  W(k  ,A  ,P  ,K  ,y 

a  a  o  o  o  o 


(97) 


The  coupled  relativistic  wave  equations  (80)  and  (81)  are  complicated 
differential  equations  that  are  difficult  to  solve  analytically,  even  for 
the  simple  case  of  a  solid.  However,  some  qualitative  results  can  be  obtain¬ 
ed  for  the  case  of  wave  propagation  in  a  solid  by  combining  equations  (80) 
and  (93)  and  assuming  that  k  and  A  (and  hence  W  )  are  not  density  dependent . 
Actually,  there  is  a  density  dependence  of  the  phase  velocity. ^  But  within 
the  crude  approximation  that  k  and  A  are  not  density  dependent,  equations 
(80)  and  (93)  yield 


dK 


?  9  /  n  \ 

kA  (in  — +  JKj+g  = 


K^k^A^ 
o  a  a 


(98) 


Then  assuming  =1/3  and  y^  =  1  in  equation  (82)  and  (83)  gives  1=1  and 

J  =  -1  ;  and  taking  n^  ,  where  a  =  adiabatic  index,  and  finally 

g  ^  »  gives  the  following  approximate  result  for  equation  (98) 


,2.2  o  a  a 
k  A  "V/ 


(a  +  1)K 


(99) 


But  the  solution  of  the  ground  state  equations  (52)  and  (53)  is  known  to 
give  Kg  at  low  pressures  and  Kq  «  Kg  at  high  pressures.^  Therefore 

within  the  limits  of  the  approximations  made  to  obtain  equation  (99),  it 

follows  that  kA  k^Aa/vo  +  1  for  very  low  pressures,  kA  k^A^for  moderate 

pressures,  and  kA  >>  k^A^  for  high  pressures-  The  values  of  a  can  range 
from  zero  to  about  two  depending  on  the  density  of  the  system. 

Finally,  an  explicit  expression  is  given  for  the  relativistic  phase 
velocity  of  mechanical  waves  in  a  thermodynamic  solid.  The  source  terms  for 
equations  (80)  and  (81)  are  Eg^  and  yg^  ,  where  Eg,,  is  given  by  equation 
(91)  while  ygj.  is  obtained  from  equation  (92)  with^ 


E®  +  P^  +  K® 


(100) 
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where  c  =  light  speed.  The  simultaneous  solution  of  (80)  and  (81)  gives 
Eor  and  Yor  •  Then  equation  (94)  is  integrated  to  obtain  the  relativistic 
sound  speed  as  follows 


_w 

c 


(101) 


Equation  (94)  shows  that  if  W  is  an  increasing  function  of  density  it  follows 
that  Yor  ^  mechanical  waves,  and  therefore .W  £  c  which  is  required 

by  special  relativity. 


7.  ELECTROMAGNETIC  WAVES  IN  MATTER.  An  analogous  calculation  can  be 
done  for  the  case  of  the  propagation  of  electromagnetic  waves  in  matter. 

For  this  case  the  radiation  energy  density  and  pressure  can  also  be  written 
in  the  form  of  equations  (57)  through  (60)  except  that  now 


hr  ■  + 

Kr--^(  *  "a’'a) 

where  E  and  H  =  relativistic  electric  and  magnetic  radiation  fields  respec¬ 
tively;  and  =  nonrelativistic  electric  and  magnetic  radiation  fields 
respectively;  e  ,  y  and  £3  *  ^a  “  relativistic  and  nonrelativistic  permit¬ 
tivities  and  permeabilities  respectively.  Therefore  the  energy  density  is  a 
function  of  many  variables.  The  electroweak  case  Includes  even  more  para¬ 
meters*  Therefore  it  is  not  possible  to  obtain  relativistic  values  of  each 
parameter  by  solving  the  two  coupled  relativistic  wave  equations  (80)  and 
(81)  unless  the  assumption  is  made  that  the  electric  and  magnetic  fields  are 
unaffected  by  equations  (80)  and  (81),  and  only  the  permittivity  and  perme¬ 
ability  need  to  be  considered.  The  solution  of  these  two  equations  gives 
only  the  relativistic  energy  density  and  Grlineisen  parameter  for  electro¬ 
magnetic  radiation*  However  an  explicit  solution  for  the  relativistic  phase 
velocity  can  be  obtained. 

The  fundamental  waves  equations  (80)  and  (81)  determine  and 

in  terms  of  and  y^^  ,  where  E^^  is  given  by  equation  (103)  and  y^j.  is 
given  by  equation  (92)  with^^ 


(102) 

(103) 


e  y 
a  a 


(104) 
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Equation  (94)  is  then  integrated  to  obtain  the  relativistic  phase  velocity 
of  electromagnetic  waves  in  a  thermodynamic  medium  as  follows 


Equation  (94)  shows  that  if  W  is  a  decreasing  function  of  density  (as  is  the 
case  for  electromagnetic  waves  xn  a  solid  or  low  temperature  quantum  system) 
it  follows  that  <1/3  and  W  ^  c  for  electromagnetic  waves  in  matter. 

Note  that  the  relativistic  phase  velocity  depends  on  the  material  parameters 
Pq  ,  Kq  ,  Yq  ,  Pj  ,  and  Yq  • 

8.  CQNCLTISIQN,  Scale  invariance  has  been  demonstrated  for  a  theory 
of  relativistic  thermodynamics  that  is  based  on  a  trace  equation.  The 
Griineisen  parameter  must  be  introduced  in  addition  to  the  pressure  to  insure 
local  scale  invariance.  For  solids  and  low  temperature  quantum  systems, 
this  means  that  the  zero^-temperature  values  of  the  pressure  and  Griineisen 
parameter  must  be  determined  simultaneously  through  the  solution  of  two  cou^ 
pled  second  order  differential  equations-  The  equations  governing  small  am¬ 
plitude  waves  In  thermodynamic  media  can  be  obtained  by  a  perturbation  cal¬ 
culation  on  these  ground  state  equations.  Two  coupled  first  order  differ¬ 
ential  equations  have  been  derived  that  describe  relativistic  wave  propa¬ 
gation  in  solids  and  low  temperature  quantum  systems.  The  solution  of  these 
wave  equations  determines  the  relativistic  energy  density  and  Griineisen  para¬ 
meter  for  radiation  in  terms  of  the  corresponding  nonrelativistic  radiation 
energy  density  and  Griineisen  parameter  and  in  terms  of  material  parameters 
of  the  ground  state.  For  mechanical  waves  the  solution  of  the  wave  equa¬ 
tions  determines  the  relativistic  amplitude  and  phase  velocity. 

Possible  observable  effects  may  occur  in  a  number  of  high  density  sys¬ 
tems.  For  instance,  unusual  dispersion  effects  and  anomalously  large  wave 
amplitudes  may  be  observed  in  the  high  pressure  states  of  solids  and  liquids 
that  occur  in  the  interior  of  planets*  stars,  and  stellar  compact 
objects. Measurable  effects  may  also  occur  in  the  vibrations  of  atomic 
nuclei  that  are  associated  with  the  giant  nuclear  resonances. Practical 
effects  may  also  be  noticed  in  the  effects  of  nuclear  explosions  and  in  the 
interaction  of  high  energy  laser  beams  with  solids. Finally,  it  should  be 
pointed  out  that  because  the  phase  velocity  and  the  wave  amplitude  must  be 
calculated  simultaneously  from  a  pair  of  coupled  nonlinear  equations,  it 
follows  that  the  phase  velocity  depends  on  the  wave  amplitude,  and  this 
characteristic  of  nonlinear  wave  propagation  may  lead  to  the  formation  of 
shocks  and  solitons  under  certain  conditions . 


APPENDIX  A  -  RADIATION  CONDITIONS.  For  radiation,  the  pressure  is 
linearly  related  to  the  energy  density  as  follows 
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p  =  r  E 
r  r  r 


(Al) 


where  =  diffuse  radiation  factor  given  by 


10 


r  1  ,  n  dw 

r  ■  3  W  dn 


(A2) 


where  the  phase  velocity  W  is  generally  density  dependent, 
tion  (Al)  with  equations  (59)  and  (60)  gives 


Combining  equa- 


P  =  r  E 
or  r  or 


(A3) 


p.  =  r  E. 

jr  r  jr 


(A4) 


But  from  the  definition  of  the  radiation  Griineisen  parameter  given  by  equa¬ 
tion  (64),  it  follows  from  equation  (A4)  that 


r  =  Y 

r  or 


(A5) 


which  means  that  the  radiation  Griineisen  parameter  is  equal  to  the  diffuse 
radiation  factor. 

If  the  pressure  and  the  internal  energy  of  the  radiation  are  written  as 


P,  =  P  +  P.  T-^ 
r  or  jr 


(A6) 


U  =  VE  =  U  +  U. 
r  r  or  jr 


(A7) 


then  the  application  of  the  Glbbs-Helmholtz  equation  of  thermodynamics 


(A8) 


yields  the  following  completely  general  equations 
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(A9) 


P 

or 


dU  , 


dU. 


j  r  dV 


dV 


(AlO) 


Combining  equations  (A3)  and  (A4)  with  equations  (A9)  and  (AlO)  respectively 
gives 


^  (VE  )  +  Y  E  =0 

dV  or  or  or 


(All) 


d^(VE.^)  -  =  0 


(A12) 


or  equivalently 


dU 


or 


U  dV 
or 


=  -  Y 


or 


(A13) 


V  dU. 

— - ^  =  (i-l)Y 

U.  dV  '■J  ^^or 
jr 


(A14) 


The  radiation  equations  (A13)  and  (A14)  can  be  immediately  integrated  to  give 


"or  ■  ’’or  ^  VV) 

"jr  ■  "jr  !  ’'or'rl 

and  finally, 

E  =  nD  exp  ( f  Y  “ )  (A17) 

or  or  \  'or  n  / 
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(A18) 


Then  combining  equation  (A18)  with  equation  (54)  gives 


^  exp  [  -  (j-1)  /  1 

j  j 


(A19) 


which  is  the  desired  result. 


APPENDIX  B  -  EVALUATION  OF  ddor/^”-  derivative  that  occurs  in  equa¬ 


tion  (72)  can  be  evaluated  from  equation  (63)  as  follows. 


n 


d6 

dS“  ■  <^or  -  ''o' 


dy 

'or  _  o 
dn  **  dn 


(Bl) 


In  order  to  evaluate  the  derivative  of  the  energy  ratio  term  in  equation  (Bl) 
one  uses  equation  (Al9)  of  Appendix  A  to  get  the  following  result 


E. 


(j-1)  (y^^  -  Y„) 


or 


(B2) 


Combining  equations  (Bl)  and  (B2)  gives  the  result 


n 


d6  E. 

-- or  ^  _jr 

dn  E. 


dy 


or 


dn 


-  n 


dn 


-  (j-l)(Y 


or 


-  ’•>■] 


(B3) 


which  is  the  desired  result. 
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ABSTRACT.  The  equation  ut  —  models  the  expansion  of  a  gas  with  density 

u(x,<)  in  a  porous  medium.  We  give  an  equivalent  formulation  of  this  equation  as  a  free 
boundary  problem,  the  free  surface  being  the  boundary  of  the  set  where  the  gas  density 
is  nonzero.  Using  this  formulation,  we  discuss  new  a  priori  estimates  for  the  pressure 
r  [m/{m  -  In  one  dimension,  our  estimates  imply  that  the  pressure  and  the 

free  boundary  are  infinitely  differentiable. 


1.  THE  FREE  BOUNDARY  PROBLEM.  The  initial  value  problem 

Ut(x.0  =  Au'^(x.O,  X  c  R^  /  <  T, 
u{x,0)  =  uo(x) 


describes  the  expansion  of  a  gas  in  a  porous  medium:  u  denotes  the  gas  density,  A  = 
V  •  V  =  ^  dj  and  m  >  1  is  a  physical  parameter. 

In  this  section  we  derive  an  equivalent  formulation  of  the  initial  value  problem  (1). 
We  assume  that  uq  has  bounded  support  0(0)  with  smooth  boundary  3fl(0).  Then, 
n(/)  svpp^i  J)  is  bounded  for  all  /.  We  rewrite  (1)  as  an  initial  value  problem  for  the 
pressure  v  (m,  (m  -  l))?/”*"  U  By  direct  substitution, 

vt(x,f)  =  (m  -  l)i:(x,<)At’(x,t)  -r  Vi;(x,/)  *,  x  t  Q{t),  t  <  T, 
i;(x,0)  =  Vo(x). 

We  assume  that  v  is  twice  continuously  differentiable  and  define  a  family  of  curves  y  —>■ 
y  €  n(0).  via  the  system 

f(y.O)  -  y. 

Note  that 

j^v{^(yA)J)  =  (m  -  l)vU(yJ).t)Av(^{y,t)J) 
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which  implies  in  particular  that  v(^{y,t).t)  ~  0  for  y  €  dfi(O).  Therefore,  if  t  <  T  with  T 
sufficiently  small,  defines  a  1-1  mapping  of  0(0)  onto  0(t). 

Summarizing  the  above  considerations  we  have: 

Ifu  is  a  weak  solution  of  (l)  for  which  v  is  twice  continuously  differentiable  on  its  (closed) 
support,  then  v  is  a  classical  solution  of  (2)  and.  for  T  sufficiently  small,  n(t)  =  ^(0(0),t) 
with  ^  defined  by  (3). 

The  converse  of  this  statement  also  holds: 

If  the  pair  (v,  is  a  classical  solution  of  (2,3)  with  fl(t)  =  ^(0(0),  <)  and  if  the  mapping 
t)  is  1-1  for  all  t  <  T,  then  u  is  a  weak  solution  of  (l). 

Here  we  have  used  the  notation  /(/)  :=  {/(A)  :  A  €  /}  for  a  function  /  and  a  set  1. 

To  verify  the  last  assertion,  let  0  be  any  test  function  with  compact  support  in  Q 
X  (0, T).  Integrating  by  parts. 


where  we  have  used  that  v  vanishes  on  the  boundary  of  Q(t)  and  therefore  the  boundary 
normal  is  parallel  to  —Vv. 

We  need  the  following 

Lemma.  Assume  that  t)  as  defined  in  (3)  is  a  1  -  1  mapping  of  n(0)  onto  n(t).  Then, 
for  any  smooth  function  /, 


Using  this  and  the  fact  that  ^(x,  •)  has  compact  support  in  (0,  T),  the  right  hand  side 
of  (4)  equals 


-  /  /  Ut(t>  ~  (  I  iVU;UP 

Jo  Ju{t)  Jo  j 

7 

4.  f{Au^^)6-  f  f 
Jo  Jo  Jenu) 


This  expression  vanishes  since  equations  (l)  and  (2)  are  equivalent  on  fl(t)  and  since 
Vu’'"  =  uVv. 

For  the  sake  of  completeness  we  include  a 

Proof  of  the  Lemma.  Denote  by  the  Jacobi  matrix  {dk^j{--.t)}  where  ij  is  the 

j-th  component  of  the  vector  From  (3)  it  follows  that  the  determinant  of  V f  satisfies 
the  differential  equation 

(dctVf)t  -An(detV0- 


Note  in  particular  that  detV^{-,i)  is  positive  for  all  t.  Using  this,  we  obtain 


f  dx  ^  f  ftU{y,t)J)(det^O{y^i) 

jQ(t)  Jn{0) 

=  dt  f  fU{y-t),t){detV^)(yJ)  dy 

Jfi(O) 

-  f  iVf)(^(y.t),t)  ■  ei(yJ}(detV^)iy,t}  dy 

Jq(0) 

*  I  /(^(y’i)^t)Av(^(yj),t)(detV^)(y,t)  dy 

Jq{0) 

=  dt  [  f[xj)dx  f  V f{x,t)  -Vvix^t)  +  f{xJ)Av{x,t)  dx. 
JQit)  Jo{t) 


(6) 


Equation  (5)  follows  by  replacing  the  last  term  in  (6)  by  a  boundary  integral  and  recalling 
that  the  boundary  normal  equals  -\’v/[Vv\. 

One  notices  the  similarity  of  the  free  boundary  problem  (2,3)  to  the  Stefan  problem. 
E.g.  the  one  phase  Stefan  problem  is  described  by  equation  (3)  and  the  heat  equation 


vt{x.t)  =  Av{x,t),  x  €  n(<),  t  <  T, 
v(x,0)  =  uo(i). 


In  this  case,  t*  denotes  the  temperature  distribution  of  water  in  a  region  fl(t)  which  is 
surrounded  by  ice. 

Existence  of  weak  solutions  for  the  porous  medium  equation  and  Stefan  problems 
can  be  proved  via  semi  group  theory  3'.  Regularity  results,  in  particular  for  the  free 
boundaries,  appear  to  be  considerably  more  difficult  to  obtain.  For  the  one  phase  Stefan 
problem  the  existence  and  C^-regularity  of  classical  solutions  was  obtained  by  Hanzawa 
■7j  and  Kinderlehrer  and  Nirenberg  illi  using  rather  sophisticated  techniques.  For  the 
porous  medium  equation  in  several  variables  no  comparable  regularity  results  are  known. 
A  major  difficulty  in  studying  existence  and  regularity  for  the  system  (2,3)  is  that  the 
parabolic  operator  degenerates  on  the  free  surface. 

For  the  one  dimensional  case  (d  =  1)  Caffarelli  and  Friedman  [4\  have  shown  the 
existence  of  classical  solutions  for  the  system  (2,3).  Using  their  result  we  proved  in  jOj  that 
the  pressure  i’  and  the  free  boundary  are  infinitely  differentiable  as  soon  as  the  support 
of  u(-,<)  is  expanding.  This  result  is  discussed  in  section  2.  In  section  3,  we  describe  new 
a  priori  estimates  for  the  multivariate  case.  If  a  suitable  regularization  or  approximation 
procedure  can  be  found,  these  estimates  imply  the  existence  of  smooth  solutions  for  the 
system  (2,3). 
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2.  C^-REGULARITY  IN  ONE  DIMENSION.  In  one  variable  we  may  assume  that 
n(0)  =  j  — l.li.  Then,  the  system  (2,3)  reduces  to 

e(=i.o)-r] 

Vi  =  (W  '  V;.  ^(-1.0  <  X  <  ^(1./)-  t  >  0. 

i)(a:.0)  =  t’C" 

The  curves  £(±1,-)  are  Lipschitz  continuous  T2  .  but  in  general  not  C^.  As  was  observed 
in  ilj,  need  not  be  continuous  at 

;=  sup{/  ;  =  y}.  y  =  ±1. 

Caffarelli  and  Friedman  4  proved  that  the  system  (7,8)  has  a  clas.sical  solution  for  t  > 
max{t-.  j.  ii}\  i.e.  the  functions  r,  Vt.  Vx  and  I'l'xx  are  continuous  up  to  the  free  boundaries. 
Using  this  result  we  proved  in  9]  the  following  optimal  regularity  result, 

Theorem.  Assume  that  r,,  is  continuous.  Then,  ^{y,  •)  ^  C^(ty.  oc),  y  ~  ±1.  and 
V  €  0*^(0.)  where  0,  {(■*^-  0  ■  v(x,  t)  >  0}  J  {(s(y'  y  —  I  }• 

Our  proof  of  this  result  is  based  on  a  priori  estimates  in  weighted  norms.  To  illustrate 
the  main  idea,  we  make  a  simplifying  assumption.  We  suppose  that  t-y  =  t]  =0  which 
implies  in  particular  that 


Vx{^{y,t)J)  0,  y  ^  ±1,  t  >  0. 

Under  this  strengthened  hypothesis  we  have  the  following  a  priori  estimate. 
Proposition.  For  any  smooth  solution  (r.  £)  of  (7,8)  and  any  A;  >  0.  ^  >  0, 

IT 

max  f  Vid'tv^  dx  -r  (  f  dx  dt  <  Ak  (9) 

^  JS  Ju{i) 

where  the  constant  Ak  depends  on  k,  8.  T  and  Vq  hut  not  on  v. 

An  analogous  estimate  is  also  valid  for  a  suitably  defined  Galerkin  approximation. 
This  yields  the  regularity  of  v  and,  by  (7).  also  the  regularity  of 

W’e  prove  (9)  by  induction  on  k.  Thus  assume  that  the  estimate  (9)  holds  for  k  <  1. 
Differentiating  (8)  and  setting  w  :=  d^v  we  obtain 

Wf  —  [m  -  l)viVxx  ^  ;2  -  /(m  ■  1)  i-rtf/'x  -t-  /  (10) 

where  /  is  a  sum  of  terms  of  the  form 

d'^v  d^v\  u  ~  y  ~  I  ^  2.  u  <  y  <  I. 
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We  multiply  (10)  by  r'^vw  with  t  \=  1  -  6  and  integrate  each  term  over  the  interval 
n(t)  =  f(l,0  ■  Integrating  by  parts  and  observing  that  the  boundary  terms 

vanish  we  obtain 


^d.j  T'vu-  -  h  -  ((m-]);2)72  -  (1  2)  /a 


{m-  1)  I 

-  [tn  -  1)  I  T‘v~u-l  -  2(ni  -  1)  ^  t~vViWWi 
2  —  (/  -  2)(m  -■  1)  /  r^vvjvu'j  = 

Ju(t) 

-  \1  ^  (/  -  2)(m  -  l)/2:  {]■>  -  lo). 


Combining  these  identities  yields 


2  Jnit) 


vw~  ^ 


n(t) 

~  h  —  12^2 


f 

(m  -1)  /  T^v"w 
Ju(t) 

03^3  —  ^  7*'vwf 


(11) 


where  the  constant  ')-j  is  positive  for  /  >  1. 


The  estimation  of  the  terms  on  the  right  hand  side  of  (ll)  is  somewhat  technical,  in 
particular  if  I  is  small.  Let  us  assume  that  /  >  4  and  refer  to  :9'  for  the  remaining  cases. 
We  need  two  auxiliary  inecjualities.  By  the  result  in  :4:.  v  is  continously  differentiable  w'ith 
modulus  of  continuity  depending  only  on  V(_,.  Lsing  this,  and  the  fact  that  t-x  does  not 
v'anish  on  either  of  the  free  boundaries,  it  is  not  difficult  to  show  (cf.  Lemmas  1,  2  in  |9  ) 
that 


Q  •  oc* ,  n  ( / ) 


(12) 


for  any  smooth  function  g.  uniformly  for  /  <  T.  Moreover,  there  exists  a  positive  constant 
cs  which  depends  only  on  t’o  and  T  such  that 


(13) 
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Using  Holder's  inequality  and  (13)  yields 


From  (12)  it  follows  that  ii’i3-(  r0  oc,n(f)  ^^.n  be  bounded  in  terms  of  A4  for  t  >  b. 
Therefore,  if  K  denotes  a  generic  constant  which  may  depend  on  and  vq.  we  have 

h-  <  K  I  (15) 

For  the  last  term  in  (11)  we  have  to  estimate  the  integrals  7^^  :=  /  T-v{d^v){d^v)u:. 
Since  ly  <  {I  2)/2  <1-3,  (12)  implies  that  |c>^'t'(-.<)ioc.rt(t)  ^  ^  ^  Hence  we 

obtain 

iWi  ^  j  T‘vw‘).  (16) 

We  choose  f  C3  less  than  '73  so  that  the  first  term  on  the  right  hand  side  of  (14) 
is  less  than  '7373.  Substituting  the  estimates  (14-16)  into  (11)  we  obtain  the  differential 
inequality 

-dt  I  t"vw~  H-  (m  -  1)  / 

<  7f(l  H-  J  T^vw^)  —  (t“*  +  cT'/cs)  J  v'^tv^. 

Integrating  this  inequality  over  the  interval  ''S.T\  and  using  that 

[  j.V<  A, 

yields  (9)  for  fc  —  /  with  6  replaced  by  2b. 

3.  A  PRIORI  ESTIMATES  IN  SEVERAL  VARIABLES.  Based  on  the  results  in 
the  univariate  case  one  is  tempted  to  conjecture  that  the  free  surface  ^Q(t)  is  infinitely 
differentiable  in  the  neigborhood  of  any  point  where  the  support  of  i’(-r<)  is  expanding.  A 
first  step  in  this  direction  is  again  a  result  by  Caffarelli  and  Friedman  'Si  who  showed  that 
the  free  surface  is  Holder  continuous.  More  recently.  Gurtin,  McCamy  and  Socolovsky 
i6i  discovered  an  interesting  transformation  which  yields  a  system  of  partial  differential 
equations  for  the  free  surface  which  does  not  involve  r.  However,  this  system  does  not 
fall  in  any  of  the  standard  categories  and,  except  for  the  univariate  case  :8:  existence  of 
smooth  solutions  could  not  been  established.  We  formulate  below  a  priori  estimates  which 
would  imply  the  existence  of  classical  solutions  for  the  system  (2,3)  if  an  appropriate 
approximation  procedure  can  be  found. 
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Theorem.  Assume  that  dn(0)  is  smooth.  Vi:o(i)  ^  0  for  i  €  dn(0)  and  {v.^)  is  a 
smooth  solution  of  the  system  (2,3).  Let  ]w'I  q  Tl[ai=k  denote  the  semi 

norm  of  Then,  for  any  k  >  2  ^  d/2,  there  exist  constants  T  and  B  which  depend 

on  fc,  n(0)  and  vo  but  not  on  v  and  ^  such  that 

max  |t-(-,0:fc.n(f!  £  -6. 


The  proof  of  the  Theorem  is  given  in  a  forthcoming  paper  lOL  As  in  the  univariate 
case,  our  arguments  are  based  on  energy  estimates.  The  major  difficulty  is  the  estimation 
of  the  tangential  derivatives  of  v  in  a  iieigborhood  of  50  (t). 


i: 


'2 


;3 

■A] 

i5] 

;6j 

A] 

is: 

i9: 

lio; 

ill! 

;i2 
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ON  THE  TREATMENT  OF  POISSON'S  EQUATION  BY  PIECEWISE 
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ABSTRACT 

An  efficient  numerical  method,  used  previously  for  a  ordinary  differential 
equation  is  here  extended  to  a  partial  differential  equation  (in  particular, 
Poisson's  equation)  of  a  function  of  two  variables.  Bicubic  functions  are  used 
as  the  basic  approximations.  Residuals  are  liquidated  by  setting  their  integrals 
equal  to  zero  over  specified  subregions  of  analyticity. 

INTRODUCTION 

The  approximate  solution  of  ordinary  differential  equations  with  the  use  of 
piecewise  polynomials,  or  spline  curves,  and  the  partition  method  was  investi¬ 
gated  by  Langhaar  and  Chu  (1).  From  the  results  of  their  application  of  piece- 
wise  cubic  and  piecewise  quintic  approximations  to  a  number  of  ordinary  differen¬ 
tial  equations,  they  have  concluded  that  this  method  has  certain  advantages  over 
other  methods.  For  example,  this  method  yields  directly  the  first  derivatives  in 
the  case  of  a  piecewise  cubic,  or  the  first  and  second  derivatives  in  the  case  of 
a  piecewise  quintic  approximation  in  addition  to  the  value  of  the  function  at  a 
finite  number  of  points.  The  finite-difference  method  of  solution  yields 
directly  only  the  values  of  the  function,  an  obvious  drawback,  since  for  many 
problems  in  mechanics,  the  derivatives  are  the  sought  for  unknowns.  Also,  the 
finite-element  method  is  based  on  variational  principles  and  the  physical  problem 
at  hand.  The  piecewise  polynomial  method  is  free  of  such  formulations,  and  is, 
in  that  sense,  more  general. 

In  this  Investigation,  the  piecewise  polynomial  method  is  extended  to 
partial  differential  equations  (in  particular,  Poisson's  equation)  of  a  function 
of  two  variables. 

For  an  approximating  polynomial,  a  bicubic  was  chosen,  i.e., 

3  3  i  j  (1) 

<|)(x,y,)  “I  E  tti^x  y 

i=0  j=0  ^ 
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CONSTRUCTION  OF  THE  BICUBIC  POLYNOMIALS 

A  finite  closed  region  of  the  x-y  plane  is  divided  into  MN  subregions  (M 
being  the  number  of  subregions  in  the  x-direction  and  N  being  the  number  of  sub- 
regions  in  the  y-direction)  by  the  lines  x  =  and  y  =  yj  such  that 

0®=x  <x,  <x_...<x  =  L 
o  1  2  m 

0=y  <y,  <y„»**<y  =  W 
o  •^l  n 


L 

W 

0  ^i^i+1  ^ 


Figure  1 


Equation  1  is  written  in  the  following  explicit  form 

<|>(x,y)  “81+  ^2^ 

+  a^y  +  ^gxy  +  a^x2y2  +  agX^y2 
+  a9y2  +  a^gXyZ  +  ajjx2y2  + 

+  ai3y3  +  aj^xy3  +  ai5x2y3  +  ajgx3y3 

In  terms  of  values  0 ,  (()  »  <|>  and  ^  at  the  four  corners  of  the  subregion, 
one  has  x  y  xy 
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•  •  •  • 


+  3  137^  +  a^xy^ 


+  3i5x2y3  +  aigx3y3  »  4,^ 


(x.  ,y.)  =  3,  +  2aaX.  +  3a,, x?  + 


Y  V-*,  y-'  j  " 

X  i  j 


..  +  ai„y3  +  2ai5X.y3  +  3ajgx2y3 


2  +  aoX?  +  . 


*  (x^.y  )  =  35  +  agx.  +  a^xj  +  agx^ 


,.  +  3a„y2  +  3a,.x  v2  +  3a  ,5x2y2 


‘I3^j  ^ 


1  J 


+  =  ♦a 


f^yCx.  .y.)  =  ag  +  2a7X.  +  3a8x2  +  Za^Qy^  +  ...  +  fea^gX^yS  +  9a^gx2y2  =  4,^ 


"  ^2^  *••  ®i4v; 


2  4.  +  p  V  v3  +  a„x2y3  +  a^^x^yS 


j  +  1  i  j+1  i  j  +  1 


♦  (x^^l-Yj)  -  a.  .a,Vl  ■  -3  >1 


•  *  “l-Vll  * 


*  “16*1+,'',  ■  *9 


*  “3V1  *  “si.l  *  -  *  “l-Vl'-J.!  *  ^3*?*,=',’ 


i+1  3  1+1 


j  +  1 


j  +  1 


+  a,,x3  y3  =41 
16  i+lj+l 


13 


xy  i+lj+l 


■  ♦le 
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These  equations  when  written  in  the  form  of  matrices,  are 


1 

X 

x2 

.  .  .  x2y3 

X3y3 

i 

i 

i  j 

i  3 

0 

1 

2x 

...  2x  y3 

3x2  y  3 

i 

i  j 

i  3 

0 

0 

0 

•  •  •  3x2y2 

3x3  y2 

m 

m 

• 

• 

i  3 

• 

• 

« 

0 

• 

"o 

0 

•  •  • 

* 

^*i+l5^j+l 

0 

0 

0 

•  •  • 

9x2  v2 
i+Fj+1 

<•  > 

f  \ 

♦  1 

♦2 

^3 

♦3 

i 

• 

• 

r  =  1 

• 

• 

a  15 

♦  is 

a  16 

♦  16 

i  t 

or  just 

IE]  [A]  =  [®] 

The  bicubic  polynomial,  equation  1,  when  written  in  the 
♦(x.y)  =  [I,x,x2,x3,  ....  x2y3,x3y3]  [A] 

or  alternatively 


(2) 

form  of  matrices  is 
(3) 


(|)(x,y)  =  tl,x,x2,x^,  ...  x2y3,x3y3]  [C]  [$] 


(4) 


where  [C]  is  as  yet  some  undetermined  16  x  16  matrix  of  constants. 

Comparison  of  equations  3  and  4  shows  that 

[A]  =  [c]  m 

but,  from  equation  2 
[A]  =  [E]‘‘  [$] 

Therefore,  [C]  =  [E]"* 

and  a  bicubic  polynomial  whose  coefficients  are  expressed  in  the  generalized 
deflection  can  now  be  written  for  each  subregion. 
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POISSON’S  EQUATION 


Poisson's  equation  is 

L  ((t>)  =  f(x,y)  (5) 

where 


8x2  5y2 

If  is  substituted  for  4*  in  equation  5,  then 

L(D  =  [L(l),  L(x),  L(x2),  L(x2y3),  L(x3y3 ) ] [C] [S]  =f(x,y)  +  e(x,y) 

Ay 

where  e(x,y)is  the  error  resulting  from  the  approximation  of  4  by  4)* 

Each  subregion  is '  divided  into  four  parts  and  the  integral  of  the  error 
e(x,y)  over  each  of  these  parts  is  set  equal  to  zero  (Partition  Method).  This 
procedure  yields  a  set  of  4MN  equations  involving  the  4(M+1)(N+1)  unknown  4*^, 
i .  e . ) 

//  L(J*)dxdy  =  //  [L(l),  L(x),  L(x2),  .  .  .  L(x2y3),  L(x3y3)]dxdy [C]  [$] 
each  part  each  part 

=  //  f(39r)dxdy 
each  part 


The  remaining  4(M+N+1)  equations  needed  are  obtained  from  the  boundary  con¬ 
ditions. 

EXAMPLE 

Consider  the  torsion  of  a  bar  of  rectangular  cross-section,  Fig,  2.  This 
problem  reduces  to  the  solution  of 

v2  4)  =  1  in  R 
4»  =  0  on  S 


where  the  shear  stresses  are  given  by 
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-'xz  =  "2G0|^ 
•'yz  “  “2^ 


Let  L=W=1  and  M=N=2,  then  the  unit  square  region  R  is  divided  into  four 
equal  subregions  each  of  which  is  divided  into  four  equal  parts.  Setting  the 
integral  of  the  error  over  each  of  these  parts  equal  to  zero  yields  sixteen  equa¬ 
tions. 


Figure  3. 


The  numbering  scheme  is  shown  in  Figure  3.  At  the  lower  left  node, 

01  =  (^,  (|)2  “  <l>3  “  ♦y*  <>4  “  'J’xy* 

Imposing  the  boundary  condition  0  on  S  yields 

01  =  05  =  09  =  013  "  021  =  025  =  029  =  033  =  0 
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Also,  since  ^  **  constant  along  S 

^2  “4^10  “  ^14  "  4^22  ^  4^26  '  ~  ^ 

and 

♦  3  ”  ♦?  ”  4^11  ®  4>27  “  ^31  *  4>35  ® 

The  above  20  equations,  together  with  the  16  equations  obtained  from  setting  the 
integral  of  the  error  equal  to  zero,  constitute  a  set  of  36  linear  algebraic 
equations  from  which  the  36  deflections  can  be  obtained.  Knowing  the  the 
problem  is  completely  solved. 

Alternatively,  the  integral  of  the  error  along  the  boundary  can  be  set  equal 
to  zero,  along  16  equal  distinct  segments  of  the  boundary  S  together  with  the  4 
equations 


— 

♦9  “  ^25  “ 

♦  33  " 

^  0 

Nonzero  values  of  4),  (f 

♦y*  4> 

<>xy 

= 

2.0357 

at 

(0,  0) 

0.3304 

at 

(0,  V2  ) 

'I’xy 

= 

-2.0357 

at 

(0,  1) 

= 

0.3304 

at 

( V2 .  0) 

<|) 

= 

0.0737 

at 

(  V2  .  V2 ) 

= 

-0.3304 

at 

( V2.  1) 

♦xy 

- 

-2.0357 

at 

(1,  0) 

♦x 

= 

-0.3304 

at 

(1.  V2) 

^xy 

= 

2.0357 

at 

(1.  1) 

The 

results  are 

shown 

in  Table  1 

in  Figure  2. 

The 

value  for  ^ 

xy 


R,  i.e.”  4)  •  These  values  are  given  for  M=5N=2  and  M=sN=4,  along  with  the  exact 
values.  Tfl^  values  of  found  by  satisfying  the  boundary  conditions  polntwise 
were  the  same  as  those  found  by  setting  the  integral  of  the  error  along  the  boun¬ 
dary  equal  to  zero. 
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M  =  N 


4>6  =  <t>x(0>l/2  ) 


4)  17  =  <t>  (  V2  ,  V2  ) 


2  0.33036  0.073661 

4  0.33743  0.073656 

exact  0.3375  0.073671 

Table  1. 
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ADAPTIVE.  SELF-VALIDATING  NUMERICAL  QUADRATURE 


George  F.  Corliss 
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Marquette  i University 

L.  B.  Rail 

.Mathematics  Research  Center 
University  of  Wisconsin- Madison 

Abstract.  Integrals  of  a  function  of  a  single  variable  can  be  expressed  as  the  sum 
of  a  numerical  quadrature  rule  and  a  remainder  term.  The  quadrature  rule  is  a  linear 
combination  of  function  values  and  weights,  or  the  integral  of  a  Taylor  pol3'nomial,  while 
the  remainder  term  depends  on  some  derivative  of  the  integrand  evaluated  at  an  unknown 
point  in  the  interval  of  integration.  Numerical  quadrature  is  made  self-validating  by  using 
interval  computation  to  capture  both  the  roundoff  and  truncation  errors  made  when  using 
a  given  rule.  Necessary  derivatives  can  be  generated  automatically  by  using  well-known 
recurrence  relations  for  Taylor  coefficients.  In  order  for  quadrature  methods  of  this  type  to 
be  accurate  (in  the  sense  that  small  intervals  are  produced)  and  efficient  (to  obtain  results 
of  given  accuracy  in  a  reasonably  short  time),  an  accurate  scalar  product  and  an  adaptive 
strategy  are  required.  The  necessary  scalar  product  and  support  for  interval  arithmetic  are 
provided  in  Pascal-SC  (for  microcomputers)  and  ACRITH  (for  IBM  370  computers).  The 
adaptive  strategy  chooses  the  subintervals  of  integration  and  the  order  of  the  quadrature 
formula  in  each  subinterval  on  the  basis  of  guaranteed,  rather  than  estimated,  information 
about  the  error  of  numerical  integration  in  each  subinterval.  The  program  described  in 
this  report  implements  standard  Newlon-Cotes,  Gaussian,  and  Taylor  series  methods  for 
numerical  integration.  Waj’s  to  handle  singularities  are  discussed,  and  comparisons  are 
given  with  a  standard  numerical  integration  method. 

1.  Requirements  for  Automatic  Integration  Algorithms.  In  l2j,  de  Boor  formulates 
fundamental  requirements  for  an  automatic  algorithm  for  numerical  approximation  of  the 
integral 

(i-i)  =  i 

J  a 

of  a  function  of  a  single  real  variable.  Such  an  algorithm  requires  (i)  the  limits  of  integration 
a,  6,  (ii)  access  to  a  procedure  for  the  evaluation  of  /(x)  for  x  in  the  interval  of  integration, 
(iii)  tolerances  a,p  on  the  desired  absolute  and  relative  error,  respectively,  and  (iv)  a  limit 
M  on  the  number  of  function  evaluations  allowed. 

As  output,  the  algorithm  should  produce  an  estimate  7"  for  the  value  of  If  which 
satisfies 

(1.2)  ,1/  ~  J’  <  max{a.p  If 

Sponsored  in  part  by  the  U.  S.  Army  under  Contract  No.  DAAG29-80-C-0041. 
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Furthermore,  the  algorithm  should  be  efficient,  computing  as  few  function  values  as  pos¬ 
sible.  It  should  also  be  reliable,  which  will  be  taken  here  to  mean  that  either  the  desired 
accuracy  (1.2)  is  guaranteed,  or  a  message  to  the  contrary  is  returned  to  the  user,  possibly 
with  additional  information  about  the  cause  of  failure.  As  pointed  out  by  de  Boor  [2], 
algorithms  which  use  only  values  of  /(x)  at  a  finite  number  of  points  cannot  meet  the 
above  requirements  in  general;  nevertheless,  accurate  and  efficient  automatic  integration 
algorithms  can  be  formulated  for  wide  classes  of  integrands  !3],  |23j. 

This  paper  presents  automatic  quadrature  algorithms  which  attain  the  goals  of  reli¬ 
ability  and  efficiency  by  use  of  automatic  differentiation  and  interval  computation.  They 
make  use  of  information  about  the  integrand  on  entire  subintervals  of  integration,  rather 
than  at  a  discrete  set  of  points.  The  results  combine  the  self-validating  algorithms  of  Gray 
and  Rail  [Sj,  [9],  llO],  and  the  notion  of  adaptive  quadrature  :2],  [3j,  [23].  Adaptation  is 
carried  out  on  the  basis  of  guaranteed,  rather  than  estimated,  bounds  for  the  error  of  the 
approximate  integration  over  each  subinterval.  Furthermore,  the  given  algorithm  has  the 
ability  to  detect  and  handle  certain  types  of  singularities  in  the  integrand,  and  even  to 
verify  nonexistence  of  the  integral  in  some  cases. 

In  the  terminology  of  Rice  [23],  the  method  described  here  has  the  following  features: 

Interval  Processor  Component: 

Variable  order  rule  with  remainder  using  interval 

arithmetic  to  give  guaranteed  bounds. 

Bound  Estimator  Component: 

Direct  analysis. 

Special  Behavior  Components: 

Polynomials. 

Roundoff  level. 

Singularities  in  derivatives. 

Jump  discontinuities. 

Removable  singularities. 

x'^-type  singularities. 

All  are  strictly  validated. 

Interval  Collection  Management  Component: 

Ordered  list. 

None  discarded. 

The  method  of  this  paper  does  not  belong  to  the  large  family  of  10^  or  so  algorithms 
considered  by  Rice  because  of  the  use  of  interval  computation  and  automatic  differentiation, 
which  were  not  considered  in  [23^  Details  of  the  actual  implementation  of  the  algorithms 
presented  here  in  an  environment  w'hich  supports  interval  computation  will  be  given  in  §7. 
The  next  few  sections  describe  the  underlying  methodology. 

2.  Self-validating  Evaluation  of  Quadrature  Formulas.  Self-\'alidation  of  numerical 
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computations  is  one  of  the  basic  motivations  of  interval  analysis  jli,  115!,  Il6].  The  goal 
is  to  obtain  an  interval  which  contains  the  desired  result,  be  it  real  or  set-valued.  In  the 
case  of  the  integration  problem  (1.1),  a  self-vali dating  interval  method  produces  an  interval 
J  =  [c,rf|  which  is  guaranteed  to  contain  the  value  //of  the  integral.  The  width  of  this 
interval  inclusion  will  depend  on  uncertainties  in  the  values  of  the  integrand  and  the  limits 
of  integration,  the  roundoff  error  in  the  actual  computation,  and  the  truncation  error 
appropriate  to  the  method  used.  All  of  these  quantities  can  be  estimated  in  a  tedious 
way  by  the  techniques  of  classical  error  analysis,  an  effort  which  is  unnecessary  in  the 
computational  environment  described  below.  However,  once  an  interval  |c,dj  containing  1 
is  found  by  whatever  method,  one  has  the  following  approximations  to  1  and  corresponding 
error  bounds  l21j: 


(2.1) 


/■  = 


U~r\<\(d-c), 


for  absolute  error,  or 


(2.2) 


r  = 


2cd 


d' 


If  ~  r  i 
iT 


I  < 


d  —  c\ 

TTd'' 


for  relative  error,  with  cd  >  0  in  this  case.  It  follows  that  (1.2)  will  be  satisfied  if  an  interval 
J  ~  jc,d;  can  be  obtained  with  width  w(J)  =  d  -  c  small  enough  so  that  w(J)  <  2a  and 
«’(J)  <  p  c  +  d,. 

First,  the  problem  of  finding  an  interval  inclusion  J  of  If  will  be  considered.  The 
basic  method  for  interval  integration  by  use  of  standard  formulas  for  numerical  quadrature 
or  Taylor  series  was  first  described  by  Moore  |14-.  To  illustrate  Moore’s  idea,  consider  a 
standard  interpolatory  integration  formula  of  the  form 

(2.3)  I  f{x)dx  =  ^ti;./(x,)  ^c„h- 

where  h  =  {b  —  a)/n.  and  a  <  ^  <  b.  A  formula  of  type  (2.3)  will  be  called  a  quadrature 
formula  of  order  p  on  n  points.  The  ordinary  Gauss  and  Newton-Cotes  integration  formulas 
follow  this  pattern  [7]. 

It  should  be  noted  that  integration  formulas  such  as  (2.3)  give  the  exact  value  of  the 
integral  If  of  functions  which  are  differentiable  p  times.  The  only  difficulty  is  that  the 
value  of  (f  is  unknown.  For  practical  computation,  it  is  thus  customary  to  express  (2.3)  as 
the  sum  of  a  rule 


(2.4)  r^f  =  Y^wJix,) 

1=) 

of  numerical  integration,  and  a  (truncation)  error  term 

(2.5)  enf  =  c„h  ■  f,,{^-h). 
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where 


(2.6) 


p' 


denotes  the  Taylor  coefficient  of  order  p  in  the  expansion  of  /(^  -r  h).  It  is  usual  to 
compute  J*  —  Tnf  to  approximate  the  value  of  the  integral,  and  to  estimate  e„/  somehow. 
Of  course,  if  /  is  a  polynomial  of  degree  p  ~  1  or  less,  then  Cnf  =  0,  and  I f  =  r^f. 

A  self-vali dating  computation  of  the  rule  r„/  of  numerical  integration  is  straightfor¬ 
ward  in  an  environment  which  provides  interval  arithmetic  and  monotone  interval  inclu¬ 
sions  of  the  library  functions  used  in  the  evaluation  of  f{x)  for  a  given  x.  Let  S  denote  the 
screen  of  floating-point  numbers  available,  and  IS  the  corresponding  set  of  closed  intervals 
iu,v;,  u,v  €  S.  If  /  is  evaluated  on  an  interval  X  t  IS  using  interval  arithmetic  and 
library  functions,  then  the  result  is  the  natural  interval  inclusion  F{X)  of  /  on  X  such 
that 


(2.7)  f{X)  =  {fix)  \  X  €  X}  C  F{X), 

:15j,  [l6’l.  If  Xi  respectively  denote  the  smallest  intervals  in  IS  which  contain  the  real 
numbers  u;,,  x,,  that  is,  Au;,;,  jVxi,Ax,],  where  V.A  denote  the  monotone 

downward  and  upward  roundings  from  the  real  numbers  R  to  S  jll],  then  the  inclusion 

n 

(2.8)  <-„/eii„/  =  X^H',f(-V,) 

t  =  l 

is  guaranteed,  and  the  computation  of  can  be  done  automatically. 

An  automatic,  self-validating  computation  of  the  error  term  (2.5)  requires  an  addi¬ 
tional  ingredient.  This  consists  of  subroutines  for  the  generation  of  the  Taylor  coefficients 
/,,(x,  h)  of  /.  These  use  well-known  recurrence  relations  for  the  arithmetic  operations  and 
library  functions  used  to  evaluate  /(i)  for  given  x  [6],  [15],  [16],  [l9).  A  suitable  compu¬ 
tational  environment  provides  these  routines.  Corliss  and  Chang  [5]  have  shown  that  the 
calculation  of  /o(x,/i),  /i(x,/i),...,/j,(x,/i)  requires  about 


(2.9)  i  =  ap^  ^  bp-h  c 

units  of  time,  where  a  depends  on  the  number  of  multiplications,  divisons,  and  calls  to 
library  functions  in  the  computation  of  /,  and  b  depends  on  the  number  of  additions  and 
subtractions  required.  In  any  case,  interval  evaluation  of  exactly  the  same  recurrence 
relations  yields  the  corresponding  interval  inclusions  Fo{X,  H),...,Fp{X,H)  such  that 

(2.10)  fk(x.h)  eFk{X,H).  A-  =  0. 1. ... ,  p, 

for  all  X  e  and  h  ^  H  [iS],  [I6i.  Thus,  the  desired  interval  inclusion 

(2.11)  cj  €  E„!  =  C,H  ■  F,iX,S) 
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(Corrected  copy] 


ran  I)0  roinpntc'd  autoTnatirally,  given  intervals  II, X  f_  IS  surli  that  <*„  h  h  J1  ^ 

and  |a,/j|  C  A\  (It  is  assuriH'd  thal  a  <  h:  the  contrary  rase  can  he  liandN'd  easily.)  It 
follow's  from  the  above  that 

(2.12)  7/ 6  /?„/  1  r„/ ^ 

a  Ibrinnia  for  aiilonialic.  self* validating  miincTical  (|ijadratur<».  Onto  again,  if  /  is  a  poly¬ 
nomial  o(  dogrre  p  -  1  or  less,  then  7'),(A,  II)  =  1^,0],  so  that  If  €  Unf  ■,  iind  the  width  of 
|c,f/|  depends  only  on  the  roundoll  error  in  the  calculation  of  r„/. 

This  approach  was  the  basis  of  ab  actual  computer  program  19),  which  met  the  accu¬ 
racy  criteria  (1.2),  if  possible,  by  choosing  II  sufficiently  small  |18).  However,  the  problem 
of  enicicncy  was  not  addressed. 

Instead  of  splitting  the  numerical  integration  formula  (2.3)  into  a  rule  of  numerical 
integration  (2.^1)  and  an  error  term  (2.5),  it  will  be  lieljjlul  later  to  consider  it  to  represent 
the  scalar  product  of  the  augmented  fuiic.tion~ value  vector 

(2.13)  f  =  (/(x,),...,/(x„),/,.(^,//)), 
and  the  aijgrncntcd  uiciyht  vector 

(2.1'1)  y/  =  {wi,.  .  .  ,Wn,Cnlt), 

which  is  independent  of  tlic  integrand  f  and  depends  only  on  the  spticific  formula  (2.3) 
used.  Thus, 

(2.15)  7  =  wfeW-F  =  J, 


where 


(2.1G)  ^  F  -  (F(A', ),..., F(A%.),F,,(.Y, 77)) 

and 

(2.17)  W  =  (VV,,...,IT„,C„77) 


arc  the  corresponding  interval  inclusions  of  f,  w.  This  allows  the  computation  to  use  re¬ 
cently  developed  methods  for  highly  accurate  calculation  of  real  and  interval  scalar  prod¬ 
ucts  |l2|.  "^rhis  results  in  a  considerable  decrease  in  widtJi  due  to  roundoff  error  in  the 
computed  value  of  J . 

'Pile  integration  formula  (2.3)  can  be  inter|)reted  as  a  single-panel  rule,  or  as  a  multt- 
}>ancl  rule,  meaning  that  a  simj)ler  formula  on  tn  points  is  applied  k  times  to  the  corre¬ 
sponding  nnmhor  of  subintcrvals  of  A'  =  |a,5i,  with  n  <  km.  Denoting  the  subintervals  of 
A'  by  A',,  i  —  1,2,  ...,/c,  this  means  that  an  integration  formula 


(2.18) 


/  f{T)dT  ~  y2u\jf{xij)  -I-  Cimllt  •  /r(^i,/'«)i 
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holds  in  each  subinterval,  where  =  u,’(X,).  ll  has  been  shown  118'  that 


(2.19) 


1 


w(x,y 

■■  p\  ’ 


'C  c„u.-(A')F('’)(X)  • 


P' 


In  addition  to  the  decrease  in  width  of  Fj,(A'.  le(A'))  by  a  factor  of  w{X)^’  as  t<;(A^)  becomes 
small,  the  width  of  F^’'^X)  will  overestimate  the  width  of  f^^'^X)  by  less  as  w(A')  0 

for  continuous  jl5;.  Thus,  the  gain  in  calculating  the  error  terms  over  smaller 

subintervals  can  be  substantial.  Roundoff  error  in  adding  a  number  of  interval  inclusions 
of  one-panel  rules  (2.18)  can  again  be  reduced  considerably  by  expressing  the  result  as  the 
scalar  product  of  the  extended  augmented  function-value  vector 

(2.20)  F=  {F(Xii.....F{Xt„.),Ff,{Xi,Hi),....F{Xki),...,F{Xkm),FpiXk,Hk)) 
with  the  extended  augmented  weight  vector 

(2.21)  W  = 


3.  Taylor  Series  Methods.  The  seminal  paper  by  Moore  Hi  also  provides  the  basis  for 
self-validating  numerical  integration  by  the  use  of  Taylor  series,  although  the  techniques 
presented  by  him  in  this  case  are  directed  toward  the  solution  of  the  initial-value  prob¬ 
lem  for  ordinary  differential  equations.  For  numerical  integration,  Taylor  series  are  more 
appropriate  than  fixed  quadrature  formulas  for  interval-valued  endpoints  of  intervals  of 
integration,  as  will  be  discussed  in  §6.  Furthermore,  Taylor  series  support  the  rigorous 
approach  to  automatic  recognition  and  treatment  of  singularities  described  in  §5. 

Of  course,  one  could  consider  (1.1)  to  be  the  solution  If  --  y(6)  of  the  initial-value 
problem 


(3.1) 


y'{x)  =  f[x).  y[a)  =  0, 


and  apply  Moore's  methods  directly'.  However,  since  f{x)  in  (3.1)  is  independent  of  y, 
unlike  the  usual  case  in  differential  equations,  it  is  simpler  to  use  the  capability  to  generate 
a  segment  of  the  Taylor  series  and  the  interval  remainder  term  automatically  to  perform 
a  self- validating  calculation  of  the  desired  integral. 

In  particular,  instead  of  expanding  the  solution  y(x)  of  (3.1)  at  x  —  a  as  in  the  case 
of  a  differential  equation,  it  is  advantageous  to  expand  /(x)  at  the  midpoint  c  —  (a  +  6)/2 
of  the  interval  X  =  [a,  6]  of  integration.  It  will  be  assumed  that  the  integrand  /  has  p  >  0 
derivatives  in  the  interval  of  integration.  For  h  =  {b  —  a)  /2,  one  has 


(3.2) 


/(x)  =  m  ^  /'(c)(x  -  c) ..  nc) 


(c) 


(n-  1)1 


/'"'(f) 


(i  -  c)' 


n; 
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caw  rorrifUjU'd  aiJlorfralirnllx;  irfU*rva|s  (  //.  X  •  IS  sii<fi  (hat  ^  h  <  //, 

ainl  aj}\  '  A\  (It  is  assurnod  that  u  h:  ihr  roiiirarv  <  us<‘  can  t><‘  lian<ll<'<l  oasilv\)  It 
Tollows  (rorn  t  in*  above*  lliat 

f2J2)  IJ  >  h\J  r:,,f  :r,r/. 

a  lonruila  for  anloinalie  .  se^lf-validal  inj;  iiuiin’riral  (Hiadriil  ur<‘.  Once'  af^a.iii,  il  /  is  a  [)oly- 
nornial  o(  />  1  or  less*  lhe*ii  /  ^,(A*  II)  ()*()  .  so  i  lia!  / J  -  llj,]’,  and  I.Im*  w^idtli  ol 

\(\(l  d<'|)(*iids  only  on  tin*  ronndoll  error  in  llie‘  eale  ulalion  e)rr7,/. 

'Phis  approadi  was  tlie^  basis  of  an  acliial  rompute'r  proi^rarri  \U\,  whie  li  iix^-  the  aexu- 
rae  y  e  riteria.  (b2).  if  possible  ,  Iw  e  lioosinj;  II  snllie  ie'ntly  small  18i.  lloweweT,  tlie‘  problem 
e>f  edlie  ie'ne  y  was  ne)1  addre'sse*el. 

Instc’ael  ofsplillin^  Ihe  rnime‘riral  inle^f^ralion  fe>rmnla  into  a  rule  ol  nurrKTiral 

inte'^rat-ion  (2*1)  arul  an  (xror  leTtn  (2.r>),  it  will  lx*  ludpfijl  laleT  to  eonsielex  il.  te)  re^pre^sexit 
live*  sealar  product  of  the*  auj^riieuiteHl  fmiriuni-valut  vector 

f  . 

and  Ibe-  au{;me‘nle‘<l  wcKjht  ^'cctor 

(2-14)  w  (w;, . 

wliicfi  is  itid(‘|)(*n(l(*nt  of  the*  inte^grand  /  and  de'pe^nds  only  on  tfie  spe'cific  formula  {2.'>) 
us<*d.  Tims, 

(2*ir.)  /  w-f  W-F  ./* 

whexe* 

(2-lfi)  F  (f  (A*0 . /•  (A'.)*/v(A.y/)) 

anej 

(2.17)  W  (VV, . M.,(Ad/) 

are*  the*  e‘orre*sf)oridiii^  intexval  inclusions  ol  f,  w.  ddiis  allows  the'  < ornputation  to  use*  re'- 
ce'nt.ly  de'vedope'el  me  thods  fe>r  highly  accurate*  cale  ula.tie)n  e)f  real  anel  intexval  scalar  [*re)d- 
ucts  I2l.  4diis  re*sults  in  a  ce)nside'rable'  de*e  re^ase*  in  wielth  clue*  te)  roumleifl  e'rror  in  tlu* 
ce)m[)ut.e'el  va  lue*  e)f  J , 

4di<*  integration  forniula  (2*.'>)  ean  be*  inteT[>r<*t<*el  as  a  .snif/lc- panel  rule\  e>r  as  a  rrtulh- 
panel  rule*,  rne’aning  that  a  simple*r  formula  on  ///  pe>ints  is  a|>|>lie*el  k  time's  to  tfie*  ce)rre'- 
speyneling  number  of  subiiitc'rvals  e)f  A  a.h  .  with  n  km.  l)e*noting  the  s\jbintervaJs  ol 

A  by  A,,  /  1,2,...,/;,  this  means  that  an  integration  le)rmula 

/ 

(2.  IK)  /  f{:r)(lx 

1 1)  '  ^7  7ri  I 
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holds  in  each  subinterval,  where  hi  =  It  has  been  shown  il8i  that 

(2.19)  ^Cimw{Xi)F^f'HXi)  ■  C  c„MX)F^^'>(X)  ■ 

pi  pi 

In  addition  to  the  decrease  in  width  of  Fp(X.  w{X))  by  a  factor  of  tv{X)^’  as  w{X)  becomes 
small,  the  width  of  f'^^’^{X)  will  overestimate  the  width  of  f^^'^X)  by  less  as  u^(A’)  0 

for  continuous  il5|.  Thus,  the  gain  in  calculating  the  error  terms  over  smaller 

subintervals  can  be  substantial.  Roundoff  error  in  adding  a  number  of  interval  inclusions 
of  one-panel  rules  (2.18)  can  again  be  reduced  considerably  by  expressing  the  result  as  the 
scalar  product  of  the  extended  augmented  function-value  vector 

(2.20)  F  =  (F( An. .  •  • ,  F( Ai^),  Fp( Ai,  I7i), . . . ,  F(Ah),  •  •  • ,  F( Afc^),  F„( A^,  Hk)) 

with  the  extended  augmented  weight  vector 

(2.21)  W  =  ii,  .  .  .  ,  W  ifYiH  \ ,  .  .  .  ,  W  kl  •  ■  •  •  i krri':  ^  kmH  k'j . 


3.  Taylor  Series  Methods.  The  seminal  paper  by  Moore  iHi  also  provides  the  basis  for 
self-validating  numerical  integration  by  the  use  of  Taylor  series,  although  the  techniques 
presented  by  him  in  this  case  are  directed  toward  the  solution  of  the  initial-value  prob¬ 
lem  for  ordinary  differential  equations.  For  numerical  integration,  Taylor  series  are  more 
appropriate  than  fixed  quadrature  formulas  for  interval-valued  endpoints  of  intervals  of 
integration,  as  will  be  discussed  in  §6.  Furthermore,  Taylor  series  support  the  rigorous 
approach  to  automatic  recognition  and  treatment  of  singularities  described  in  §5. 

Of  course,  one  could  consider  (1.1)  to  be  the  solution  If  =  y{b)  of  the  initial-value 
problem 


(3.1) 


y'{x}  ^  f{x),  y{a)  =  0, 


and  apply  Moore's  methods  directly.  However,  since  f{x)  in  (3.1)  is  independent  of  y, 
unlike  the  usual  case  in  differential  equations,  it  is  simpler  to  use  the  capability  to  generate 
a  segment  of  the  Taylor  series  and  the  interval  remainder  term  automatically  to  perform 
a  self-validating  calculation  of  the  desired  integral. 

In  particular,  instead  of  expanding  the  solution  y(x)  of  (3.1)  at  x  =  n  as  in  the  case 
of  a  differential  equation,  it  is  advantageous  to  expand  /(x)  at  the  midpoint  c  ^  (a  +  b)/2 
of  the  interval  X  —  [a,  6]  of  integration.  It  will  be  assumed  that  the  integrand  /  has  p  >  0 
derivatives  in  the  interval  of  integration.  For  h  ^  [b  —  a)/2,  one  has 


(3.2) 


f{x)  =  f{c)^f'{c){x-c)  +  r{c) 


n—  1 


(n-l)! 
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[Corrected  Copy] 


for  71  <  wlioro  ^  f  A‘  is  botwot'n  c  an<]  x.  Lot  Le  an  inlorval  inclusion  of  on 

A’.  Then,  6  which  is  an  interval-valued  constant.  Kroin  (3.2), 


(3.3)  fix)  €  /(<•)  +  r^c){x  -  r)  +  • .  •  +  /*"-  "(c)  ~ 

l>et 


(3.1) 


g(x)  ^  f{c){T  -  c) Ac)'  -.j. 


I  ...4  /l'-■)(c)^^..-.l^ 


111 


be  an  indefinite  integral  of  the  Taylor  polynomial  of  degree  n  —  1  of  /(x).  Theni 

,  (x  _  +  i  it 

(rO  f  / 


(3.5) 

where 


f  f{x)dxeg{x)  -|-F^"‘^(A') 
Ja  “ 


(n+  1)! 


QJn, 


n-  1 


^  A-(C,— 1. 


1^.0 

i  even 


(3.6) 


•L 


(n+  ])! 

jjn+l 

(u-f  1)!’ 


(n  +  l)! 


,  for  n  odd, 
for  77  even. 


Note  tliat  subtraction  does  not  “cancel”  equal  intervals  in  general.  One  has  [u,  e|  —  [u,  u]  = 
|u  —  —  u]  /  |0,0)  unless  u  —  i>,  in  which  case  the  interval  consists  of  a  single  point  |t), 

|ir>j,  |1G).  If  the  series  were  expanded  at  x  =  a  instead  of  x  =  c,  then  the  width  of  the 
interval  remainder  terms  would  be  increased  by  a  factor  of 

Formuhis  (3.5)-(3.C)  resemble  (2.12)  for  ordinary  quadrature  rules,  with  the  evaluation 
of  the  integrand  at  n  ]>oints  replaced  by  its  value  and  the  values  of  its  first  n  —  J  derivatives 
at  a  single  poinl.  If  /  is  a  polynomial  of  degree  n  -  1  or  less,  then  =  [0,0],  and 

only  roundoff  error  effects  the  width  of 

J„  can  be  computed  directly  from  the  automatically  generated  interval  Taylor  coeffi¬ 
cients  Fk{C,  II)  and  Fk{X,  77)  of  /,  A:  =  0, 1, . . . ,  n  <  p. 

Since 


(3.7)  f  77  -  0,  1,...,P, 

the  intcrscclion  principle  |8],  |y)  can  be  u.sed.  One  has 

(3.8) 


1/  s  n  ■' 

n  -O 
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This  intersection  can  be  calculated  as  the  corresponding  interval  Taylor  coefficients  of  the 
integrand  are  generated: 


j  /,,  =  Jn-  (a  Rieniann  sum). 

(3.9) 

(  ^  ri  “  1  fl  ^  ~  . P' 

This  provides  a  means  to  determine  the  highest  useful  term  of  the  Taylor  expansion,  since 
the  calculation  can  be  terminated  when  effective  decrease  in  the  w'idths  of  the  intervals 
{In}  ceases,  or  when  the  desired  tolerance  is  met. 

As  in  the  ca.se  of  (2.12),  formula  (3.6)  can  be  evaluated  as  the  scalar  product  of  the 
vectors 

(3.10)  F  =  (Fo(C,  H),  Fi(C, .«),...,  ,  (C.  H).  T„(C,  H)), 

where  7’n(C.  II)  ^  Fr,{C.Il)  -  Fn{C,H)  or  Tn{C.Il)  -  2Fn{C.H)  according  as  n  is  odd 
or  even,  and  the  vector 

(3.11)  W--= 
where 

(3.12)  kr.0A . n. 

A:  ~t-  1 

The  width  of  J„  can  al.so  be  reduced  somewhat  if  the  expansion  is  about  C  -  jc.c.. 
c  6  S.  In  this  rase,  the  interval  stepsize  H  might  have  to  be  increased  a  very  small 
amount  to  maintain  inclusion  of  the  subinterval  of  integration.  In  §5,  it  will  be  noted  that 
expan.sion  about  an  endpoint,  as  in  (3.1),  or  some  other  point  in  the  interval  of  integration, 
can  be  helpful  if  the  integrand  has  removable  singularities. 

4.  Adaptive  Strategies.  The  computer  program  INTE  [9]  demonstrated  the  reality 
of  a\Jlomalic.  self-validating  numerical  quadrature  using  Newton-Cotes  and  Gaussian  in¬ 
tegration  formulas  in  the  way  discussed  in  §2.  (Euler-Maclaurin  integration  was  added 
lo  the  capabilities  of  INTE  later  llOI.)  However,  there  was  no  attempt  to  address  the 
problem  of  efficienci,  a  defect  remedied  in  the  program  described  later.  In  order  lo  satisfy 
the  accuracy  criterion  (1.2),  if  possible,  INTE  simply  divided  the.  interval  of  integration 
into  a  sufficient  number  of  equal  subintervals  [Qj,  ;18..  By  contrast,  popular  numerical 
integration  packages  such  as  CADRE  iS;  and  QUADP.4CK  :17i  use  information  obtained 
about  the  behavior  of  the  integrand  to  attempt  to  reduce  the  number  of  function  evalua¬ 
tions  to  a  minimum.  The  method  given  here  uses  similar  strategies,  except  that  estimates 
of  the  error  based  on  evaluation  of  the  integrand  at  a  finite  set  of  points  are  replaced  by 
guaranteed  bounds.  This  eliminates  the  need  for  "safety  factors”  j23i. 

Adaptive  .strategies  fall  into  the  categories  of  order  adaptation,  which  relates  to  the 
choice  of  the  formula  used  in  each  subinterval,  and  subinterval  adaptation,  which  de¬ 
termines  how  the  original  interval  of  integration  is  broken  up  into  subintervals.  Order 
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Ibr  Ji  ]),  wh('r(‘  f  ’  A  is  l>olw(MMi  r  aiifl  s.  bc^  au  inl<Tval  inrlusioD  of 

A'.  'PluMi,  ‘  /'  ).  whicli  is  an  iniiTval-valnrd  constant.  From 


(■•{■••i)  fi^r)  ,  f(r)  ■  f'(<){:r  c) 


/'"  'Hr) 


h>  ))! 


/,-(r,)(  V) 


Hi-r)  r)  ■  f'{r) 


an  indefinite'  inte'j^ral  of  tlu'  'Pay lor  polynomial  of  dof^rc'e'  ri  I  of  f(x).  "Phen, 

f^‘  l/r  (  j-  ..U'  I  I  1^- 

(:■•-'■■)  /  9(j-)|  I  .  .1,,. 

./„  '<■  [n  I  1):  i-i 


{t  i  1)! 


(n  I  I)! 


(n  II)! 


y/(rO(A') 


(7/  !  I)! 


for  n  odd, 


for  7/  ('V('n. 


Note'  that  subtrae  tion  dex's  not  ''c.ane  e'P'  ('<pjal  inte'rvaJs  in  ^;('ne'rab  One'  has  j7/,7>| 

\u  v,  v  u\  /  |()A);  tinle^ss  u  v,  in  wliich  ease'  the'  int.e'rval  e’e)i)sists  of  a  siiif^le'  peiint  |l  j, 
115,,  \('}\.  If  the'  se'rie'S  we're'  e'Xpanele'el  at  x  a  inste'ae)  of  x  c,  t.lic'ii  the'  width  e)f  tfie* 

inte'rval  re'inainde'r  le'rms  would  f)e'  ineTe'ase'd  f)V  a  lari, or  ol  2"  ’ 

Ix)rmula,s  (.'br))^(;{,(i)  re'se'irible'  (2.12)  [e)r  oreliiiary  epiadrature'  rule's,  w'itli  the' e'va.liJa.tie)n 
of  t  he'  int.e‘j;raiie|  at  7/  |)oiiil,s  re'pla.ce'el  by  its  value'  ariel  t  he'  value's  e)r  its  first*  n  I  d('ri vat.ive*s 
at  a  sinf»le'  pe)int,-  If  /  is  a  pe)lyne)mial  e>f  eU'gre'e'  7/  I  or  le'ss,  the'n  /’’^^'^(A")  |0,()|,  anel 

euily  re)une]e)lf  e'rre)r  ('ffe'cts  the’  wielth  e)f  7„, 

Jy,  can  be'  ce)m()Ute‘ei  dire'cMy  from  the'  aute)rnat.ically  ge'ne'rate'fi  irite'rval  'Taylor  coeffi- 
eie'nts  II)  and  h\{\\  II)  e)f  f.h  0,  1,. . .  ,7/  p. 

Sinee* 


If  /  f(x)dx  7,..  7/  0.1,...,  p, 


the'  i7it crstctioii  priiuiplc  IS,,  ,9  can  lie'  use'd.  Oiu’  lias 


//■  H''" 
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This  intersection  can  be  calculated  as  the  corresponding  interval  Taylor  coefficients  of  the 
integrand  are  generated: 


(a  Riemann  sum), 

n  =  1.2.  ....p. 

This  provides  a  means  to  determine  the  highest  useful  term  of  the  Taylor  expansion,  since 
the  calculation  can  be  terminated  when  effective  decrease  in  the  wddths  of  the  intervals 
{7^}  ceases,  or  when  the  desired  tolerance  is  met. 

As  in  the  case  of  (2.12),  formula  (3.6)  can  be  evaluated  as  the  scalar  product  of  the 
vectors 

(3.10)  F=  (Fo(C',^),Fi(C,77),...,F,_i(C.i/).T,(C,//)), 

where  Tn[C,H)  =  F-n[C,H)  —  Fn{C,H)  or  Tn{C^H)  -  2F„{C,H)  according  as  n  is  odd 
or  even,  and  the  vector 

(3.11)  W:- 
where 

(3.12)  Wk^-r^,  fc  =  0,l,...,n. 

k  -t  1 

The  width  of  ,/„  can  also  be  reduced  somewhat  if  the  expansion  is  about  C  —  jc,c:. 
c  c  S.  In  this  case,  the  interval  stepsize  H  might  have  to  be  increased  a  very  small 
amount  to  maintain  inclusion  of  the  subinterval  of  integration.  In  §5,  it  will  be  noted  that 
expansion  about  an  endpoint,  as  in  (3.1),  or  some  other  point  in  the  interval  of  integration, 
can  be  helpful  if  the  integrand  has  removable  singularities. 

4.  Adaptive  Strategies.  The  computer  program  INTE  [9]  demonstrated  the  reality 
of  automatic,  self-validating  numerical  quadrature  using  Newton-Cotes  and  Gaussian  in¬ 
tegration  formulas  in  the  way  discussed  in  §2.  (Euler-Maclaurin  integration  was  added 
to  the  capabilities  of  INTE  later  [10].)  However,  there  was  no  attempt  to  address  the 
problem  of  efficiency,  a  defect  remedied  in  the  program  described  later.  In  order  to  satisfy 
the  accuracy  criterion  (1.2),  if  possible,  INTE  simply  divided  the  interval  of  integration 
into  a  sufficient  number  of  equal  subintervals  [Qj,  il8:.  By  contrast,  popular  numerical 
integration  packages  such  as  CADRE  |3]  and  QUADPACK  llTj  use  information  obtained 
about  the  behavior  of  the  integrand  to  attempt  to  reduce  the  number  of  function  evalua¬ 
tions  to  a  minimum.  The  method  given  here  uses  similar  strategies,  except  that  estimates 
of  the  error  based  on  evaluation  of  the  integrand  at  a  finite  set  of  points  are  replaced  by 
guaranteed  bounds.  This  eliminates  the  need  for  “safety  factors”  |23j. 

Adaptive  strategies  fall  into  the  categories  of  order  adaptation,  which  relates  to  the 
choice  of  the  formula  used  in  each  subinterval,  and  subinterval  adaptation,  which  de¬ 
termines  how  the  original  interval  of  integration  is  broken  up  into  subintervals.  Order 
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adaptaiion  is  somewhat  simpler  than  subinterval  adaptation*  and  will  be  considered  first. 
For  methods  based  either  on  standard  quadrature  formulas  or  Taylor  series,  order  zero 
refers  to  the  interval  Riemann  sum  f  (A")  •  u^(A"),  which  always  contains  f{x)dx  i4]. 

Given  a  suite  of  numerical  integration  formulas  of  the  form  (2,3).  suppose  that  A"  t  IS 
is  the  current  subinterval  of  integration.  Specifically,  suppose  that  the  given  rules  are  of 

order  ?  for  z  —  0,  J . k  on  points.  Once  the  interval  Taylor  coefficients  Ft(X.H)  have 

been  formed*  then  the  order  p  <  k  of  the  most  accurate  rule  can  be  chosen  to  be  the  value 
of  1  for  which  the  width  of  the  error  term 

(4.1) 

is  minimum,  ?  =  0, 1, . . . ,  /c. 

Alternatively,  the  actual  approximate  integrals  could  be  examined,  but  this  requires 
more  computation  than  use  of  the  error  terms  alone.  Suppose  that  p  denotes  the  maximum 
value  of  the  width  uj(i^(C))  of  the  interval  evaluation  of  /  at  a  node  C  of  the  integration 
formula  being  used.  The  total  cost  can  be  taken  to  be  proportional  to  Ui  •  w{Ji),  where 
is  the  number  of  function  evaluations.  The  width  u;(J,:)  can  be  estimated  by  // -h  u;(£'^./), 
and  thus  i  can  be  chosen  as  the  minimizer  of 

(4.2)  m.{i)  =  in\n{w{Jo),ni{fi  ~  w(En,f)\- 

Thus,  more  nodes  are  used  in  a  given  subinterval  only  if  a  significant  reduction  in  width 
of  the  approximate  integral  results.  It  should  also  be  noted  that  a  given  integration  rule 
can  be  used  in  several  integration  formulas  having  remainder  terms  of  different  orders. 
For  example,  Stroud  and  Secrest  |24-  give  error  terms  for  Gaussian  integration  rules  on  n 
nodes  which  have  orders  1  <  P  ^  2n.  In  certain  cases,  the  error  terms  corresponding  to 
smaller  than  maximum  p  can  be  narrower  (for  example,  for  highly  oscillatory  integrands), 
and  the  use  of  these  formulas  instead  of  the  standard  ones  will  minimize  u'(J„). 

In  the  case  of  Taylor  series,  the  intersection  (3.9)  procedure  can  result  in  approxima¬ 
tions  In  which  are  considerably  better  than  J„  given  by  (3.6),  which  can  be  viewed  as  the 
sum  of  a  rule  and  an  error  term.  The  intervals  are  monotone  decreasing  in  width,  so 
the  increase  in  accuracy  has  to  be  balanced  against  the  cost  in  time  (2.9)  of  generating 
more  terms  of  the  series.  The  constants  «.,6,  c  in  (2.9)  can  always  be  determined  for  a 
given  integrand,  so  the  corresponding  heuristic  can  be  based  on  the  function 

(4.3)  6{n)  ~  w;(/n)  •  [an^  -i-  bn  -r  c). 

Generation  of  the  Taylor  series  can  be  stopped  when 

(*)  ^n  —  2  ^n—1 

(ii)  d{n  —  2)  <  0(n),  or 

(iii)  n  —  p. 

Because  of  the  difference  between  the  remainder  terms  in  (3.6)  for  odd  and  even  n,  it 
is  prudent  to  calculate  at  least  one  extra  value  of  J„.  However,  the  conditions  above 
guarantee  that  no  more  than  two  series  terms  will  be  computed  beyond  the  one  which 
gives  the  narrowest  I„. 
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Another  important  strategy  for  order  adaptation  involves  the  case  that  the  integrand 
has  singularities  in  a  certain  derivative  in  the  given  subinterval.  This  will  be  discussed 
more  fully  in  the  next  section.  If  a  certain  derivative  cannot  be  evaluated,  this  will  be 
detected,  and  the  method  will  be  restricted  to  rules  or  orders  of  Taylor  expansion  which 
use  only  the  derivatives  which  can  be  evaluated. 

The  strategy  for  subinterval  adaptation  retains  all  subintervals.  At  each  step,  the 
subinterval  which  makes  the  largest  contribution  to  the  width  of  J  is  processed  by  breaking 
it  into  further  subintervals.  This  processing  continues  until 

(i)  w{J)  is  small  enough  to  satisfy  the  accuracy  requirement  (1.2), 

(ii)  the  noise  inherent  in  function  evaluation  limits  further  reduction  of  ^{J),  or 

(iii)  more  than  the  maximum  number  M  of  function  evaluations  have  been  performed. 

The  second  termination  criterion  in  this  list  is  particularly  important.  If  the  noise  in  the 
function  evaluation  is  large  relative  to  the  accuracy  requested,  eventually  the  width  of  the 
truncation  error  Ef  is  made  so  small  that  it  adds  nothing  to  the  width  of  the  rule  Rf 
alone.  At  this  point,  further  increase  in  accuracy  is  not  possible.  Without  the  guaranteed 
bounds  provided  by  the  interval  computation,  many  standard  methods  cannot  recognize 
when  this  point  has  been  reached,  and  the  calculation  should  be  terminated.  Malcolm 
and  Simpson  113;  observe  that  the  strategy  of  processing  the  worst  subinterval  results 
in  local  errors  of  roughly  equal  magnitude.  Rice  [23]  calls  this  an  ordered  list  interval 
collection  management  component,  and  lists  some  advantages  and  disadvantages  which 
will  be  discussed  in  more  detail  in  §7. 

5.  Treatment  of  Singularities.  Among  the  quadrature  routines  which  provide  estimates 
for  If.  the  more  successful  ones  have  special  provisions  to  handle  and  perhaps  recognize 
certain  types  of  singularities.  QUADPACK  jlT],  for  example,  can  handle  integrals  of  the 
form 


(5.1) 


—  x)^^v[x)f{x)dx, 


where  a,d  >  -1  and  i;(x)  =  1,  log(a:  -  a),  log(6  -  r).  or  log(x  —  a)  log(6  -  x).  provided 
the  user  supplies  o,  jd.  and  the  form  of  v.  CADRE  [3]  attempts  to  detect  and  verify  the 
presence  of  jump  discontinuities  or  ar'^-type  singularities  in  the  integrand.  The  program 
described  here  attempts  to  recognize  and  handle  automatically 


(1)  singularities  in  derivatives  of  /, 

(2)  some  removable  singularities, 

(3)  jump  discontinuities,  and 

(4)  some  algebraic  singularities  at  endpoints. 

Since  guaranteed  bounds  are  computed  for  1  f .  the  task  of  such  a  program  is  more 
difficult  than  for  methods  which  yield  only  an  estimate.  For  example.  Gaussian  quadrature 
can  be  used  in  the  neighborhood  of  an  endpoint  singularity,  because  the  integrand  need 
not  be  evaluated  at  the  endpoint.  To  give  guaranteed  bounds,  however,  requires  that  the 
function,  or  some  of  its  derivatives,  be  evaluated  on  the  entire  interval.  Hence,  the  price 
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of  the  guarantee  is  a  restriction  on  the  applicability  of  the  program.  Either  it  verifies  that 
the  integrand  is  in  its  domain  of  applicability,  or.  if  it  cannot  deliver  guaranteed  bounds, 
it  notifies  the  user  with  an  indication  of  the  difficulty. 

An  example  will  indicate  what  can  go  WTong  at  endpoints.  For  example, 


could  be  presented  to  the  computer  as 


I'jf  =  I  An]  ~  X  dx. 

Jo 

Although  the  original  problem  (5.2)  is  well-behaved,  the  interval  integral  in  (5.3)  contains 
integrals  such  as 


(5.3) 


(.5.4) 


\'Vn  —  X  dx. 


and  consequently  does  not  exist.  Hence,  the  correct  response  is  that  (5.3)  cannot  be 
evaluated  on  the  entire  interval  of  integration,  just  on  iO,  Vtt;,  for  example.  This  suggests 
that  standard  numerical  quadrature  routines,  which  avoid  evaluation  of  the  integrand  at 
endpoints,  can  be  fooled  into  returning  values  for  integrals  such  as 

(5.5)  VTi"  -  (  -  X  dx 

for  £  sufficiently  small,  instead  of  informing  the  user  of  possible  difficulty.  The  types  of 
singularities  which  can  be  handled  by  the  techniques  presented  here  will  now  be  described. 

5.1.  Singularities  in  derivatives  of  /. 

The  integrand  /  may  not  have  enough  derivatives  for  some  rules  which  the  integration 
program  can  apply.  In  the  process  of  automatic  generation  of  interval  Taylor  coefficients 
on  an  interval  X.  the  nonexistence  of  a  derivatives  of  /  beyond  a  certain  order  is  detected 
and  reported.  As  long  as  /  itself  can  be  evaluated  on  the  entire  interval  of  integration, 
the  order  adaptation  strategy  handles  singularities  in  the  derivatives  of  /.  For  example, 
consider  the  problem 


(5.6) 


16 

3 


The  first  derivative  /'  is  undefined  at  x  =  0,  so  the  only 
entire  interval  of  integration  is  the  Riemann  sum 


(5.7) 


f 

Jo 


\^dx  G 


mm  v'X.  max  \  x 

■0.4  if.'. 4 


rule  that  can  be  applied  to  the 


10.8  . 
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At  this  point  the  subinterval  adaptive  strategy  takes  over.  At  each  step,  the  subinterval 
containing  0  requires  a  Riemann  sum.  and  thus  is  frequently  selected  for  further  processing 
because  of  its  width.  All  other  subinlervals  can  be  processed  using  higher-order  rules.  This 
strategy  applies  to  singularities  in  f  which  occur  anywhere  in  the  interval  of  integration, 
and  not  just  at  endpoints.  It  also  works  for  singularities  in  higher  derivatives,  whose 
presence  might  not  even  be  known  to  the  user.  Once  a  singularity  of  this  type  has  been 
confined  to  a  stafficiently  small  subinterval,  it  is  possible  to  meet  reasonable  requirements 
for  accuracy,  with  self- validation. 

5.2.  Jump  discontinuities 

If  the  integrand  is  given  by  an  ordinary  mathematical  expression,  then  it  is  difficult  to 
represent  a  function  which  has  jump  discontinuities,  and  yet  can  be  evaluated  at  every  point 
in  the  interval  of  integration.  However,  the  user  can  supply  a  subroutine  for  evaluation 
of  the  integrand  which  produces  jump  discontinuities.  These  appear  to  the  program  as 
singularities  in  f.  On  any  subinterval  which  contains  a  jump,  only  the  Riemann  sum 
is  available,  and  its  width  will  lead  to  frequent  selection  of  this  subinterval  for  further 
processing,  as  before.  No  special  algorithms  are  needed  in  this  case. 

This  behavior  is  similar  to  CADRE  ib2,.  Upon  recognizing  a  jump.  CADRE  subdi¬ 
vides  the  interval  and  uses  a  low-order  rule. 

5.3.  Some  removable  singularities 

The  Taylor  series  method  permits  handling  of  some  removable  singularities.  Consider 
the  problem 


(5.8) 


sinx 
- dx, 

X 


in  which  the  integrand  has  a  removable  singularity  at  x  =  0.  In  this  case,  the  integrand 
cannot  be  evaluated  directly  on  any  interval  containing  0,  but  /  can  be  expanded  at  x  =  0 
using  I'Hospital's  rule,  which  can  be  applied  automatically  ibj.  A  short  Taylor  series  with 
remainder  for  /  at  0  is  sufficient  to  bound  If  near  0,  and  the  rest  of  the  interval  of 
integration  is  processed  in  the  normal  manner. 

5.4.  Some  algebraic  singularities  at  endpoints 
Suppose  that  the  integrand  has  the  form 


(5.9) 


/(x)  =  (a-x)  '>(x). 


where  4>{x)  is  analytic  at  x  =  a.  If  c  is  chosen  closer  to  a  than  any  other  singularity,  then 
the  series  for  /  expanded  at  x  =  o  is  asymptotic  to  the  series  for  v{x)  =  (a  —  x)“*.  The 
Taylor  coefficients  t',  =  Vi[a.h)  of  v  satisfy  the  recurrence  relation 


5  —  1  \  h 


(5.10) 


V  t',  Rc  ' 


768 


where  Rc  is  the  radius  of  convergence  of  the  series.  If  /  cannot  be  evaluated  on  [a.b[.  then 
one  attempts  to  find  constants  A'l.  Kp.  sp.  sp  such  that 

(5.11)  Ki(a  -  <  /(j-)  <  Kp{a  - 

for  j  near  a.  If  such  constants  can  be  found  and  (5.11)  validated,  then  we  proceed.  If 
Si  >  1.  then  the  program  can  guarantee  that  If  docs  not  exist.  We  know  of  no  other 
numerical  quadrature  routine  which  can  validate,  nonexistence  of  an  integral.  If  s;?  <  1, 
then 

i 

(5.12)  --^^(o  -  o  V""'-  <  /  f{x)dx  S  . 

1  -  -Sl  Ja  1  " 

These  bounds  can  be  made  as  tight  as  desired  by  taking  a'  close  enough  to  o.  The  interval 
la,  a']  is  placed  on  the  list  of  subintervals,  and  processing  continues  on  the  subinterval 

ia',6i. 

A  singularity  at  b  can  be  handled  similarly. 

If  A'l,  Kp,  si,  sp  cannot  be  found,  or  if 

(5.13)  -Sl  <  1  <  sjf. 

then  the  integrand  cannot  be  handled  by  this  method,  and  a  message  to  that  effect  is  sent 
to  the  user. 

These  methods  for  recognizing  and  handling  certain  singularities  extend  the  domain 
of  applicability  of  the  program  to  include  many  integrands  which  arise  in  applications. 
However,  their  usefulness  is  somewhat  limited.  For  one  thing,  the  location  of  the  singu¬ 
larity  must  be  known  in  advance,  or  guessed.  For  popular  sets  of  test  problems,  guessing 
one  or  both  endpoints  usually  works.  For  real  problems,  locations  of  singularities  may  be 
unknown.  However,  the  method  given  here  validates  correct  guesses.  The  endpoints  to  be 
investigated  for  the  presence  of  singularities  must  be  machine  numbers,  not  intervals.  If 
an  interval-valued  endpoint  is  even  one  machine  number  wide,  then  the  problem  contains 
integrands  which  are  unbounded  on  a  set  of  positive  measure.  If  a  singularity  is  in  the 
interior  of  the  interval  of  integration,  then  we  can  determine  its  location  to  within  one  or 
two  machine  numbers.  From  this,  its  contribution  to  7/  could  be  estimated,  but  the  possi¬ 
bility  that  the  integrand  is  unbounded  on  a  set  of  positive  measure  cannot  be  eliminated. 
Since  a  validated  answer  cannot  be  produced  in  this  case,  an  error  return  is  selected.  If 
the  user  knows  that  the  singularity  occurs  at  a  machine  number,  then  the  integral  should 
be  calculated  as  the  sum  of  two  integrals,  with  the  singularity  at  one  endpoint  of  each. 

6.  Extension  to  Interval  Values.  The  definition  of  the  integral  (l-l)  has  been  extended 
to  arbitrary  interv^al-valued  integrands  /  i4i,  (20;.  For  smooth,  real-valued  functions  such 
as  those  considered  in  §2,  the  interval  integral  and  the  Riemann  integral  coincide.  The 
concept  of  interval  integration  is  useful  in  connection  with  integrands  which  have  jump 
discontinuities  or  singularities  of  various  kinds.  The  definition  given  in  |4]  can  be  extended 
to  the  case  that  the  limits  of  integration  are  interval-valued: 


f{x)dx  I  a  £  .4,  b  ^  B 
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for  A,B  ^  IR,  the  set  of  all  finite  intervals  with  real  endpoints. 

There  are  several  reasons  to  want  to  be  able  handle  interval  endpoints  of  integration. 
First  of  all,  some  real  numbers,  such  as  tt,  cannot  be  represented  exactly  in  S,  and  have 
to  be  replaced  by  the  corresponding  small  intervals  such  as  |V7r,  Aa]  in  IS.  Secondly,  the 
limits  of  integration  may  corne  from  measurements  other  estimates,  and  thus  are  known 
to  lie  between  certain  limits  even  though  their  exact  values  are  uncertain.  Finally,  one  can 
be  interested  in  bounds  for  the  value  of  the  integral  (1.1)  over  ranges  of  values  of  limits 
of  integration.  In  this  case,  rather  than  satisfy  an  accuracy  criterion  such  as  (1.2),  it  is 
usually  desired  to  find  an  interval  J  containing  (6.1)  which  is  as  small  as  possible. 

Similar  considerations  apply  to  interval-valued  integrands.  The  case  which  usually 
arises  in  practice  is  that  the  integrand  /  is  a  function  not  only  of  the  independent  variable 
X,  but  also  several  parameters  C],C2, . . . ,  For  example,  /  could  be  a  polynomial  of 
degree  m  —  1  with  coefficients  determined  by  observations, 

(6.2)  /(x)  =  Cj -f  C2X -I - C,  e  IS,  *  =  1,2,. . .  ,m. 


In  general,  given  intervals  Ci,C2,. . .  it  is  natural  to  define 


(6.3) 


_/^(x,  Cj  ,  .  .  .  ,  Crn}  —  ^1  ^  C'i,...,Cm  £  C'm}  • 


The  natural  interval  inclusions  Fk{X,Ci, . . .  ,Cm)  of  /  and  its  Taylor  coefficients  on  an 
interval  X  €  IS  are  again  obtainable  on  a  computer  by  using  interval  computation  and 
automatic  differentiation.  In  particular,  for  an  interval  polynomial  (6.2),  Fp{X,H)  =  |0,0] 
for  P  ^  j^st  as  in  the  real  case.  The  definition 


(6.4) 


Cl 


€  Cl, . ..  ,Cm 


describes  the  type  of  integrals  to  which  the  methods  of  this  paper  apply.  It  is  assumed  that 
yl,  B  and  the  coefficient  intervals  Ci  all  belong  to  IS,  and  hence  are  machine-representable. 
If  necessary,  outward  rounding  can  be  used  to  obtain  them  from  real  intervals,  preserving 
inclusion  of  the  desired  integral. 

On  the  basis  of  (2.3),  it  is  to  be  expected  that  the  best  results  will  be  obtained  for 
integrands  /  which  are  very  smooth  as  functions  of  x.  However,  one  can  always  compute 
the  interval  Riemann  sum  F[X)  •  u;(X).  This  is  self-validating,  because 


(6.5) 


L 


f{x)dxC  F{X)-w{X), 


but  is  inaccurate  and  slow  to  converge  [4],  [20].  It  is  important  to  be  able  to  confine  bad 
behavior  of  the  integrand  to  very  small  intervals  for  (6.5)  to  be  useful. 

The  meaning  of  tolerance  for  problems  with  interval-valued  endpoints  requires  some 
clarification.  Consider  the  problem 


(6.6) 
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If  we  compute  J  —  0.004.0.791,  for  example,  then  u'(J)  0.794,  although  the  estimate 

for  each  endpoint  is  in  error  by  less  than  0.005.  Hence,  a  requested  tolerance  must  be  large 
enough  to  accomodate  the  uncertainties  which  are  inherent  in  the  problem  being  solved. 

The  cases  that  one  or  both  endpoints  of  integration  are  nondegenerate  intervals  in 
IS  will  now  be  considered.  The  possible  situations  will  be  denoted  by  IR,  Rl,  and  II, 
respectively,  according  as  the  lower,  upper,  or  both  endpoints  of  the  interval  of  integration 
are  interval-valued.  One  strategy  for  handling  interval-valued  endpoints  is  to  allow  the 
uncertainties  in  the  locations  of  the  endpoints  to  be  carried  over  into  uncertainties  in  the 
locations  of  the  nodes,  as  in  INTE  i9;.  For  example,  using  a  one-panel  Simpson’s  rule  on 
4-decimal  digit  machine  gives 


(6.7) 


[B-A) 

6 


(F(i0,0.1]  +  4F’([  1.55, 1.65])  +  F’(l3.1,3.2)) 


2880 


F‘"(!0,3.2]), 


where  A  =  lO.O.l],  B  =  !3. 1,3.2].  For  f(x)  =  sin  x,  (6.7)  yields  |l. 880,2.214]  using  the 
accurate  scalar  product,  while  the  correct  answer  is  [1.994,2.000]. 

A  better  strategy  is  to  concentrate  the  uncertainty  at  the  ends: 


rOA  rZ.i  /-IS. 1,3. 2] 

(6.8)  /  f{x)dx^  /  f{x)dx^  /  f{x)dx^  /  f{x)dx. 

The  middle  part  is  handled  as  an  RR  integral  as  described  in  §§3-4,  while  the  other  two 
integrals,  of  the  types  IR  and  RI,  respectively,  will  be  treated  in  the  manner  to  be  described 
below.  In  this  example,  we  can  get  [1.984,2.073].  As  the  widths  of  the  endpoints  increase, 
the  advantage  of  the  second  strategy  becomes  more  pronounced. 

The  general  case  (6.3)  can  be  expressed  in  terms  of  integrals  of  the  above  types.  This 
case  in  turn  splits  into  six  subcases,  according  as  A  and  B  are  (i)  disjoint,  (ii)  overlapping, 
or  (iii)  one  is  contained  in  the  other.  More  precisely,  for  A  =  [Ai^Aji]^  B  = 
suppose  that  <  Bp.  Then  we  have: 

Case  1.  AR  <  BL  (disjoint  interval  endpoints).  Here,  the  integral  (6.3)  can  be  written 

rB  r  Ar  rB^  f\Bz,,Bj-!] 

(6.9)  /  f{x)dx=  /  f{x)dx+  1  /(x)di-r  /  f{i)dx, 

JA  J\AL,Ar,]  J  Ar,  JBi 


and  thus  is  the  sum  of  integrals  of  types  IR,  RR,  and  Rl,  respectively. 

Case  2.  AL  <  BL  <  AR  <  BR  (overlapping  interval  endpoints).  Here, 


(6.10) 


rB  rBi  r  r\Aji,Bi,] 

/  /(z)dx=  /  f{x)dx-^  /  /(x)dx-  /  /(x)dx, 

Ja  J\Al..Bl]  J\Bl,Ai/  JAr 


the  sum  of  integrals  of  types  IR.  II.  and  Rl. 
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Case  3.  BL  <  AL  <  AR  <  BR  (A  is  properly  contained  in  B).  Here. 

r\AL,Art\  p\At;,Bt<\ 

(6-11)  /  /(a-)dx  =  -  /  f{x)dx-  /  f{x)dx-r  /  f{x)dx, 

again  the  sum  of  integrals  of  types  IR.  11.  and  Rl. 

The  other  three  cases  are  obtained  for  B^  Afi  by  reversing  the  roles  of  A  and  B  in  the 
preceding. 

Case  1  is  undoubted h'  the  one  which  is  encountered  most  often  in  practice.  To  illus¬ 
trate  Case  3,  consider 

/•|0,5]  /.2  /•!2,3]  riSiS] 

(6.12)  /  f{x)dx--  /  /(i)dx-r  /  /(x)dx+  /  /(x)dx. 

•fi2,3i  ./[0,2i  •/;2.3:  Jz 

We  are  not  aware  of  physical  problems  which  give  rise  to  integration  problems  other  than 
Case  1  (except  to  bound  the  values  of  integrals  over  ranges  of  endpoints),  but  provision 
for  these  cases  adds  little  to  the  machinery  which  is  required  to  handle  Case  1. 

Integrals  of  types  RI,  IR.  and  II  can  be  handled  by  Gauss  or  Newton-Cotes  formulas 
with  interval-valued  nodes,  but  this  leads  to  wide  interval  bounds.  The  method  of  Taylor 
series,  as  outlined  in  §4,  applies  as  easily  to  the  cases  of  one  or  both  endpoints  interval¬ 
valued  as  to  RR  type  integration.  Hence,  the  program  uses  Taylor  polynomials  for  integrals 
of  types  RI,  IR,  and  11,  even  if  the  user  has  chosen  Gauss  or  Newton-Cotes  formulas  for 
use  on  type  RR  subintervals. 

Using  the  same  notation  as  in  §3,  g{x)  denotes  an  indefinite  integral  (3.4)  of  f{x). 
The  one-panel  form  of  the  various  integration  formulas  are  then: 

Type  RI: 

(6.13) 

Type  IR: 

(6.14) 

Type  II: 

(6.15) 

Each  uses  intersections  of  subsequent  estimates  and  order  adaptation  as  the  RR  algorithm 
does. 


|a,6J  ,  ,  ^ 

/  f(x)dxCg{x)  -h  F^”)([a,6]) 

Jn  a 


(x  -  c)’ 


(«+!)•'  la 


/" 


f{x)dx  C  g(x) 


\a,b]  (n+1)!. 


|a,6] 


|■\a,b 

ia,6j 


jia.ti 


(x-c) 


n+1 


/(x)dx^^(x)|  '  ^  F^”)([a,6])  , 

ila.bj  (n-hlj! 


[a, 6] 

ia,fc] 
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Formulation  of  the  multipanel  forms  for  the  above  types  requires  some  care.  Suppose 
the  nodes  a  ~  xq  <  <  •  •  •  <,  x^  —  h  are  selected,  and  set  K,  =  for  i  = 

1,2,...,/:.  For  type  II  integrals,  let 


(6.16) 


.j-Tn 


x)dx. 


which  is  symmetric  about  0.  If  s./  t  then 


(6.17) 


^  f(z)dl€lyj  cJ2h.f. 


If  s  £  y,  and  t  £  V,  with  i  <  j,  then 


(6.18) 


f[x)dx  -  /  f[x)dx^  /  f{x)dxi- - r  I 

€  r  f{x)dx^  f{x)dx  +  --.^  f 

JYi  JYj 


f[x)dx 


f{x)dx 


Hence, 


(6.19) 


r\a,0{  ^  rV-i 

/  f{x)dx  f(x)dx. 

J[a,h] 


The  subintervals  F,  are  chosen  by  the  same  subinterval  adaptive  strategy  as  used  for  type 
RR  integrals.  Thus,  the  algorithm  for  type  II  integrals  is  the  same  as  for  type  RR,  except 
that  the  integral  on  each  subint  erval  is  computed  using  the  one-panel  type  II  rule,  equation 
(6.15),  with  intersection. 

The  multipanel  rules  for  types  RI  and  IR  do  not  lend  themselves  to  subinterval  adap¬ 
tation.  By  definition, 


(6.20) 


/(x)dx=|J^  f[x)dx  I  ia,6]| 

^[jyj{x)dx\teYi^ 

fYy 

-  /  f{x)di^ 

J  a 

~  r  f{x)dx^  r'  f{x)d 

Ja  Jxi 


f{x)dx 


f{x)dx 


fYk 

/  /(i)dJ 

J  XL  -  I 
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Once  the  endpoints  of  a  subinterval  are  chosen,  that  subinterval  can  be  subdivided  only 
with  great  difficulty.  We  observe  that 


(6.21) 


so  the  type  II  algorithm  gives  bounds  using  adaptation.  If  those  bounds  are  not  small 
enough,  then  we  apply  the  algorithm  given  by  equation  (6,20)  using  a  fixed  stepsize  equal 
to  the  smallest  stepsize  chosen  adaptively  by  the  type  II  algorithm. 

Type  IR  integrals  are  done  similarly. 

In  the  general  case  of  the  integral  (6.3),  the  allowable  tolerances  and  the  maximum 
number  of  function  evaluations  must  be  apportioned  among  three  distinct,  independent 
integrations.  These  are  given  in  proportion  to  the  width  of  the  respective  subintervals.  If 
r(A'',)  denotes  the  tolerance  allowed  on  the  subinterval  A'i,  t*  is  the  tolerance  reimaining, 
then 


(6.21) 


T{Xi)  =  T* 


w{X,) 

w{X)  ' 


If  the  integral  on  the  first  subinterval  is  narrower  than  its  share  of  the  total  tolerance 
requires,  then  the  tolerances  on  the  other  subintervals  are  relaxed  so  that  the  total  tolerance 
can  be  met  more  efficiently.  On  the  other  hand,  if  the  integral  on  the  first  subinterval 
is  a  little  too  wide,  then  the  integrals  on  the  remaining  subintervals  can  sometimes  be 
commuted  accurately  enough  that  the  total  tolerance  can  still  be  satisfied. 

7.  Implementation  Details.  It  follows  from  the  above  discussion  that  realization  of 
adaptive,  self-validating  quadrature  routines  on  a  computer  requires  that  the  following 
features  be  supported: 

(i)  Interval  computation  (arithmetic  operations  and  standard  library  functions). 

(ii)  Subroutines  for  automatic  generation  of  Taylor  coefficients. 

(iii)  Accurate  scalar  product  of  interval  vectors  (to  minimize  width  due  to  roundoff 
error). 

The  program  INTE  [9]  was  written  in  FORTRAN  for  the  Sperry  1100,  on  which 
only  (i)  125!  and  (ii)  were  supported.  As  mentioned  above,  INTE  performs  self-validating 
numerical  integration  without  adaptation.  The  microcomputer  language  Pascal-SC  [22] 
and  the  ACRITH  package  for  the  IBM  370  series  of  computers  support  (i)  and  (iii),  to  which 
the  routines  (ii)  have  been  added.  The  implementation  described  in  [6]  for  Pascal-SC  is 
immediately  adaptable  to  ACRITH.  The  program  described  here  is  written  in  FORTRAN 
for  use  with  ACRITH.  In  particular,  the  following  operators  and  library  functions  are 
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supported: 


-t- 

SIN 

SINK 

EXP 

— 

COS 

COSH 

LOG-LN 

*■ 

TAN 

TANH 

LOGIO 

/ 

COTAN-COT 

COTANH=COTH 

ERF 

■t*  constant 

A  SIN 

ASINH 

ERFC 

ABS 

ACOS 

A  COSH 

GAMMA 

SQR 

ATAN 

ATANH 

LGAMMA 

SQRT 

ACOTAN-ACOT 

ACOTNH-ACOTH 

The  Taylor  series  terms  Fn[C,H)  and  are  calculated  by  using  the  well- 

known  recurrence  relations  'l5j,  [161,  [19l  for  the  operators  and  functions  given  above.  In 
this  application,  the  “point”  of  expansion  and  the  stepsize  are  interval-valued  to  give  the 
desired  inclusions,  but  the  recurrence  relations  remain  the  same.  It  is  possible  for  the  user 
to  augment  the  list  of  library  functions  given  above  if  the  required  recurrence  relation  for 
Taylor  coefficients  of  the  new  function  is  known. 

Given  an  integrand  of  the  type  considered,  for  example, 

(7-1)  /(-)  =  rh' 

the  program  first  parses  it  into  a  code  list  [l9j: 


Operator 

Operand  1 

Operand  2 

Result 

SQR 

X  (—Tempi) 

Temp4 

-i- 

1  (  — Temp2) 

Temp4 

Temp5 

/ 

4  (  —  Temps) 

Temp5 

F 

In  order  to  compute  the  series  for  /  expanded  at  C  with  stepsize  H,  this  code  list  is 
interpreted  to  obtain  the  sequence  of  calls 

Tempi  :=  {C,H)  {The  series  for  x.} 

Temp2  (1)  {Constant  series.} 

Temp3  :=  (4)  {Constant  series.} 

Call  ITSQR(Templ,Ternp4) 

Call  ITADD(Temp2,Temp4,Temp5) 

Call  ITDIV(Temp3,Temp5,F). 

The  result  is  an  array  which  contains  the  interval-valued  series  for  /  and  an  indication  of 
how  many  terms  were  computed.  For  example,  the  subroutine  ITSQR(t/,  F)  computes  the 
series  for  V'  =  given  the  series  for  U,  by  means  of  the  recurrence  relations 
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where  V,  =  V,{C.H)  and  V,  -  Ui[C,H)  denote  the  tth  Taylor  coefficents  of  V  and  V\ 
respectively  !l9],  p.  49.  The  interval  function  ISQR  computes  ISQR(X)  =  instead  of 
X  ■  A',  which  is  preferable  in  general,  since,  for  example.  =  iO,  1]  while  |-l,lj  • 

;  — 1,  Ij  =  |  —  1.  li.  The  parsing  and  interpretation  at  runtime  described  above  is  unnecessary 
in  Pascal-SC,  because  the  compiler  generates  the  required  code  |6i. 

The  interval  Taylor  coefficients  Fj  (A,  C),  F^iX.  C).  ...  are  maintained  in  a  record-like 
structure.  If  “F”  is  the  name  of  the  function  being  expanded,  then 


LF  Index  of  last  known  nonzero  term; 

MF  Index  of  last  known  term; 

OFL  Vector  of  series  terms — left  (lower)  bound; 
OFR  Vector  of  series  terms — right  (upper)  bound. 


The  designations  for  other  variables  replace  “F”  in  the  above. 

The  algorithms  used  depend  on  whether  the  endpoints  of  the  interval  X  of  integration 
are  elements  of  S,  that  is,  machine  numbers,  or  whether  they  are  intervals.  The  basic 
algorithm  is  for  the  case  X  =  [a, 6]  ^  IS,  and  is  called  the  RR  (REAL-REAL)  algorithm; 

1.  Compute  the  integral  on  [a,  6]; 

2.  Add  [a,  b]  to  the  list  of  subintervals; 

3.  Loop 

4.  Find  the  subinterval  on  which  the  width  of  the  integral  is  largest; 

5.  Bisect  it; 

6.  Compute  the  integral  on  the  left  subinterval; 

7.  Add  the  left  subinterval  to  the  list; 

8.  Compute  the  integral  on  the  right  subinter vai; 

9.  Add  the  right  sxibinterval  to  the  list; 

10.  Compute  the  integral  on  [0,6]  by  summing  the  integrals  on  all  the  subintervals; 

11.  Exit  when  accuracy  tolerance  is  met; 

12.  Exit  with  warning  when 

13.  no  further  improvement  in  accuracy  is  possible, 

14.  or  A/  function  evaluations  are  exceeded; 

15.  End  loop. 
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Each  subinterval  X  is  maintained  in  a  data  structure  of  the  following  form; 


XA 

XB 

OPTORD 

WIDINT 

SINT 

WEGHT 


FNVAL 


FNTRN 


Left  endpoint  of  the  subinterval: 

Right  endpoint  of  the  subinterval: 

Order  of  the  derivative  used  to  compute  the  remainder; 

Width  of  the  integral  on  this  subinterval; 

Interval-valued  integral  on  this  subinterval; 

Vector  of  interval  valued 

weights  (Gauss  and  Newton-Cotes), 
stepsize  (Taylor); 

Vector  of  interval-valued 

function  values  (Gauss  and  Newton-Cotes), 
series  terms  (Taylor); 

Vector  of  interval-valued 

function  values  (Gauss  and  Newton-Cotes), 
series  terms  (Taylor), 

including  remainder  terms. 


At  steps  1,  6,  and  8,  the  integral  is  computed  on  the  subinterval  [XA,XB]  using  one- 
panel  versions  of  Gauss,  Newton-Cotes,  or  Taylor  polynomials  as  outlined  in  Sections  2  and 
3.  The  weight  vectors  and  functions  are  arranged  in  the  vectors  WEGHT  and  FNTRN, 
respectively,  in  such  a  way  that  the  interval  inclusion 

(7.3)  =  Z!  WEGHT,  *  FNTRN^ 


of  f{x)dx  is  computed  as  a  single  inner  product.  Similarly, 

(7.4)  =  ^^’EGHT,  *  FNVALi. 

i 

At  step  13,  J  C  Bf,  then  the  loop  is  exited.  In  this  case,  further  reduction  of  the  width 
of  the  truncation  error  cannot  reduce  w{J).  For  Gauss  and  Newton-Cotes  formulas,  SINT 
is  used  only  to  give  WIDINT.  For  Taylor  polynomials,  SINT,  is  intersected  with  (7.3). 
SINT,  is  computed  using  the  intersection  principle  discussed  at  the  end  of  §3.  For  a  few 
subintervals,  ^^  SINT,  is  narrower  than  (7.3),  while  the  situation  is  usually  reversed  for  a 
large  number  of  subintervals,  because  (7.3)  uses  the  accurate  scalar  product. 

The  arrangement  of  subintervals  in  the  arrays  listed  above  must  be  relatively  straight¬ 
forward  to  allow  J  to  be  computed  by  a  single  scalar  product  operation.  Each  iteration  of 
the  loop  from  step  3  to  step  15  removes  one  subinterval  from  the  list  and  replaces  it  with 
two  subintervals.  Subintervals  are  not  otherwise  deleted  from  the  list.  Hence,  the  follow¬ 
ing  simple  allocation  scheme  works:  On  the  fth  pass  through  the  loop,  the  information 
about  the  left  subinterval  is  stored  in  the  locations  previously  used  by  its  parent,  and  the 
information  about  the  right  subinterval  is  stored  in  the  (f  +  l)st  locations,  following  the 
already  computed  values.  Hence,  insertion  requires  no  searching.  The  widest  subinterval 
is  found  at  step  4  by  a  sequential  search  of  the  array  WIDINT(l..i). 
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By  contrast.  QLADPACK  jl7i  maintains  its  list  of  pending  subintervals  in  sorted 
order,  so  no  search  is  necessary  for  the  next  subinlerval  to  be  processed.  However,  new 
subintervals  are  inserted  at  locations  found  by  a  sequential  search,  followed  by  changing 
pointers  to  all  following  entries  in  the  list.  For  each  subinterval  processed,  the  program 
does  one  sequential  search,  while  QUADPACK  does  two.  In  addition,  QUADPACK  uses 
two  sets  of  pointer  adjustments. 

For  integration  using  Taylor  polynomials,  the  maintenance  of  list  of  subintervals  is 
somewhat  more  complicated,  because  the  program  reuses  the  series  which  it  has  previ¬ 
ously  computed.  To  illustrate  the  ideas,  consider  the  first  execution  of  the  loop  at  step  3. 
At  that  point,  the  list  of  subintervals  contains  only  one:  X  =  la,  6]  itself.  FNTRN  contains 
OPTORD~l  terms  of  the  series  for  /  expanded  at  c  =  (a  +  b)/2  and  the  truncation  error 
term  involving  /r(oPTORD)^j^j  equation  (4.6).  Provided  that  the  requested  tol-  - 

erance  exceeds  the  noise  inherent  in  the  function  evaluation,  a  stepsize  h  can  be  computed 
which  is  small  enough  that  the  requested  tolerance  per  unit  step  is  satisfied  on  the  interval 
\c  —  h,c  -i-  A,.  Notice  that  if  a  relative  tolerance  is  requested,  then  this  requires  a  current 
estimate  for  J .  The  value  of  the  integral  on  this  subinterval  can  be  computed  at  a  cost 
proportional  to  OPTORD,  instead  of  a  cost  proportional  to  OPTORD^,  which  would  be 
required  to  generate  J  directly  by  using  the  recurrence  relations  for  /.  Following  this,  the 
two  subintervals  \a,c  -  h]  and  [c  -j-  A,  6]  are  processed  directly.  Consequently,  this  method 
breaks  the  subinterval  of  integration  into  three  parts,  rather  than  bisecting  it. 

Thus,  for  integration  by  Taylor  polynomials,  step  5  of  the  RR  algorithm  is  replaced 
by 

5.0'  Compute  A  such  that  the  tolerance  is  satisfied  on  jc  —  A,c  -j  A]; 

5.1'  Compute  the  integral  on  [c  —  h,c  +  A]  from  information  in  FNVAL; 

5.2'  “left  subinterval”  ;=  [XA.  c  —  Aj; 

5.3'  “right  subinterval”  :=  [c  -|-  A,  XB], 

The  middle  subinterval  [c  —  A,c  -f  A]  is  maintained  on  the  list  of  subintervals  so  that 
its  contribution  to  J  is  included  in  the  scalar  product  in  (7.3).  This  has  the  helpful  side 
■  effect  that  A  can  be  chosen  somewhat  optimistically.  If  the  choice  is  too  optimistic,  then 
[c  —  A,  c  +  Aj  w'ill  be  selected  later  for  further  processing  as  the  worst  subinterval,  at  which 
time  it  will  be  broken  into  three  parts.  One  of  the  new'  subintervals  will  occupy  the  place 
of  the  parent  interval  in  the  list  maintained,  while  the  other  two  will  be  added  to  the  end 
of  the  list. 

A  further  refinement  could  be  implemented.  The  stepsize  A  computed  at  step  5.0'  has 
the  following  property:  On  each  subinterval  of  length  2A  which  is  contained  in  [XA,XB], 
the  use  of  a  Taylor  polynomial  of  degree  OPTORD— 1  yields  an  integral  which  satisfies  the 
requested  tolerance.  The  integration  on  such  a  subinterval  can  be  done  with  half  the  usual 
work.  No  series  for  the  truncation  error  needs  to  be  computed  because  the  truncation 
error  can  be  bounded  by  using  the  global  remainder  term  on  [XA,XB].  If  [XA,XB]  can 
be  covered  by  a  few  subintervals  of  length  2A,  then  this  could  be  done,  and  division 
into  three  parts  would  be  needed  only  when  the  middle  part  is  relatively  small.  This 
refinement  could  improve  the  efficiency  of  the  program.  However,  the  improvement  would 
likely  be  modest,  because  the  advantages  of  intersecting  subsequent  estimates  on  each 
small  subinterval  would  be  lost,  and  the  number  of  subintervals  added  to  the  list  would 
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no  longer  be  constant.  The  program  also  does  not  reuse  function  evaluations  required  by 
Gauss  or  Newton-Cotes  formulas,  although  it  could  be  modified  to  do  so. 

8.  Numerical  Examples.  We  give  four  examples  to  illustrate  the  accuracy  and  the 
reliability  of  the  program.  All  computations  were  done  in  double  precision  on  an  IBM 
4341  computer,  using  calls  to  ACRITH  routines  for  all  necessary  interval  calculations, 
including  scalar  products. 

fO.7  j 

Example  1.  /  =  /  - dx. 

io.e  l-i 

Interval  bounds  for  the  answer  can  be  computed  in  three  ways: 

(8.1)  I  =  log(l  -  0.6)  ~  log(l  -  0.7), 


(8.2) 


I  =  log(4/3). 


or  by  adaptive,  self-validating  quadrature. 

Neither  0.6  nor  0.7  are  machine  numbers,  so  they  are  converted  to  intervals  which 
are  one  machine  number  wide.  All  three  methods  give  the  interval  [0.2876820724517808, 
0.2876820724517811],  but  the  results  are  20,  13,  and  14  machine  numbers  wide,  respec¬ 
tively.  That  is,  the  program  is  capable  of  accuracies  comparable  to  evaluation  of  the 
analytic  expression  for  the  answer.  This  result  required  43  equivalent  function  evaluations 
on  18  subintervals.  In  order  to  appreciate  the  accuracy  achieved  by  the  program,  we  must 
consider  some  details  at  the  level  of  machine  numbers.  If  (8.1)  is  evaluated  without  sim¬ 
plification  using  interval  calculations,  the  result  is  the  hexadecimal  interval  |Z  4049  A588 
44D3  6E41,  Z  4049  A588  44D3  6E55],  which  is  20  machine  numbers  wide.  Using  (8.2),  the 
last  4  hexadecimal  digits  are  [Z  ...  6E45,  Z  ...  6E52j,  which  is  13  machine  numbers  wide. 
The  adaptive,  self-validating  quadrature  program  gives  jZ  ...  6E44,  Z  ...  6E52],  which  is 
14  machine  numbers  wide. 


Example  2 


■r 


16  /  .  , 
\' xdx  =  —  (see  §5.1). 

O 


This  example  illustrates  the  order  adaptation  required  to  handle  the  nonexistance  of 
/'(O).  The  program  gave  the  interval  [5.33333  33333  27,  5.33333  33333  36].  It  stopped  when 
it  had  used  400  effective  function  evaluations  on  226  subintervals.  Most  of  the  subintervals 
were  clustered  near  the  origin.  Away  from  0,  as  many  as  13  series  terms  were  used. 


Example  3. 


X.  where 


f{^) 


0,  X  <  0.3, 
1  X  >  0.3. 


This  example  illustrates  integration  of  a  simple  discontinuous  function.  The  parser 
does  not  accept  a  piece- wise  definition,  so  this  function  was  coded  by  hand.  Table  1  shows 
the  results  for  tolerances  0.0  and  l.OE-15.  and  for  recognizing  that  the  function  has  a 
singularity  on  the  interval  of  integration. 
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Width  of 
Integral 

Function 

Evaluations 

Subintervals 

Tolerance  =  0.0 

jo,  1] 

2.50E-16 

218 

146 

jO.  0.3;  -T  0.3.  Ij 

2.78E-17 

22 

29 

Tolerance  =  l.OE-15 

[0,  1! 

7.77E-16 

194 

130 

jo.  0.3]  +  10.3,  1] 

1.39E-16 

10 

5 

Table  1.  Integrating  a  Discontinuous  Function. 

As  is  usually  the  case,  the  program  performs  much  better  when  the  user  recognizes 
the  presence  of  a  discontinuity.  The  performance  would  be  even  better  if  the  discontinuity 
occured  at  a  machine  number.  Notice  that  the  cost  of  requesting  the  program  to  do  the 
best  it  can  (tolerance  =  0.0)  is  only  slightly  more  than  the  cost  when  a  lesser  tolerance 
is  prescribed.  The  table  shows  more  subintervals  than  evaluations  because  evaluating  a 
series  which  is  =  0  is  not  counted  as  an  evaluation. 

f  1 

Example  4.  / - ^dx. 

This  example  illustrates  the  performance  of  the  program  as  the  integrand  varies  from 
very  smooth  to  being  undefined  at  a  point  in  the  interval  of  integration.  These  calculations 
were  done  in  single  precision  with  an  absolute  error  tolerance  request  ofT.OE-5.  An  error 
code  of  66  signals  that  the  program  was  unable  to  meet  the  requested  tolerance,  while  67 
means  that  it  was  unable  to  evaluate  the  integrand.  As  the  problem  begame  more  difficult, 
the  program  required  more  effective  function  evaluations  and  more  subintervals. 
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Error 

Function 

Maximum 

Absolute 

Alpha 

Code 

Evaluations 

Subintervals 

Order 

Error 

0.00 

0 

2 

2 

3 

0.0 

0.05 

0 

8 

2 

20 

4.4E-16 

0.10 

0 

8 

2 

20 

1.8E-14 

0.15 

0 

8 

2 

20 

6.6E-12 

0.20 

0 

8 

2 

20 

7.8E-10 

0.25 

0 

5 

2 

15 

4.1E-06 

0.30 

0 

15 

6 

15 

1.9E-10 

0.35 

0 

12 

6 

13 

4.5E-08 

0.40 

0 

12 

6 

13 

1.8E-07 

0.45 

0 

21 

10 

15 

6.1E-08 

0.50 

0 

19 

10 

14 

1.4E-07 

0.55 

0 

16 

10 

12 

7.9E-06 

0.60 

0 

24 

14 

12 

2.4E-07 

0.65 

0 

35 

18 

15 

6.4E-08 

0.70 

0 

29 

18 

12 

5.0E-06 

0.75 

0 

38 

22 

12 

1.3E-07 

0.80 

0 

48 

26 

15 

6.6E-08 

0.85 

0 

43 

26 

12 

7.5E-06 

0.90 

0 

62 

34 

15 

6.8E-08 

0.95 

0 

73 

42 

14 

1.2E-07 

1.00- 

66 

254 

150 

8 

3.2E-01 

1.05 

67 

2 

Table  2.  Performance  Profile. 
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ABSTRACT 


We  report  about  the  computational  aspects  of  high  level  algorithms 
developed  for  efficiently  processing  the  diverging  and  converging  branch 
systems  in  nonserial  dynamic  programming.  A  special  feature  of  these 
algorithms  consists  of  a  special  technique  devised  for  processing  the 
network  functions  such  that  the  minimum  amount  of  storage  is  employed.  It 
is  shown  that  if  k  is  the  discretization  level  of  the  state  and  decision 
variables  then  the  space  complexities  are  0(k)  and  0(k  )  for  the  diverging 
and  converging  branch  systems  respectively.  The  resultant  time 
complexities  are  also  developed.  These  savings  in  computational 
complexities  enhance  the  attractiveness  of  dynamic  programming  as  a  tool 
for  processing  more  complex  nonserial  systems. 


i.  INTRODUCTION 


When  considering  nonserial  dynamic  programming  networks  [12] 
attention  is  usually  focussed  initially  on  the  following  four  basic  well 
structured  systems:  diverging,  converging,  feed  forward  and  feedback  loop 
systems,  A  characteristic  of  each  of  these  nonserial  systems,  is  the  fact 
that  at  least  one  subsystem  either  receives  inputs  from  more  than  one 
subsystem  or  sends  outputs  to  more  than  one  subsystem.  Alternatively,  for 
at  least  one  of  the  stages  of  these  systems,  the  output  is  not  the  input 
to  the  next;  thus,  there  exists  at  least  one  n  such  that  the  output  t 
^n-1*  input  of  the  next  stage.  This  distinguishes  them  from  the  usual 
serial  systems. 

Examples  of  the  above  classical  nonserial  systems  and  various 
combinations  of  them  abound  in  real  life  ([2]  [13]  [18]),  For  example, 
they  are  encountered  in  the  study  of  chemical  processing  systems,  natural 
gas  transmission  pipelines,  water  resources  systems,  energy,  communication 
and  computer  networks.  There  are  important  reasons  to  treat  these 
problems  from  the  standpoint  of  nonserial  dynamic  programming.  In 
general,  however,  the  resultant  problems  are  more  difficult  computation^ 
ally  than  classical  serial  dynamic  programming.  It  is  clear  that 
computational  advances  in  serial  dynamic  programming  play  an  important 
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role  in  the  study  of  nonserial  dynamic  programming  processes.  As  an 
extension  of  this  argument,  analysis  of  complex  nonserial  dynamic  program¬ 
ming  processes  consisting  of  various  combinations  of  each  of  the  basic 
structures  is  also  aided  by  computational  advances  in  the  four  classical 
structured  nonserial  dynamic  programming  systems  outlined  earlier. 

Hitherto,  limited  attention  has  been  paid  to  efficient  algorithms  for 
treating  nonserial  dynamic  programming  networks.  In  particular  there  is  a 
complete  absence  of  any  discussions  relative  to  their  computational 
complexities.  This  shortcoming  was  recently  addressed  in  [14].  The 
emphasis  was  on  the  development  of  efficient  computing  algorithms  which 
will  minimize  the  usual  storage  requirements  of  dynamic  programming 
encountered  while  processing  the  first  two  classical  nonserial  systems, 
namely  the  diverging  and  converging  branch  systems.  In  this  paper  we 
merely  outline  the  problems  and  concentrate  on  the  complexity  analysis  of 
the  resultant  algorithms.  The  reader  is  referred  to  the  above  referenced 
paper  for  algorithmic  details. 

2.1  THE  BASIC  DYNAMIC  PROGRAMMING  ALGORITHM  FOR  THE  DIVERGING  BRANCH 

SYSTEM 


A  diverging  branch  system  (see  Fig.  1)  is  the  easiest  of  the 
elementary  nonsertal  structures  to  analyze.  For  simplicity,  we  first 
consider  a  two  branch  system.  The  stage  transformations  t(.,.)  and  return 
functions  r(.,.)  both  for  a  main  serial  process  i  (i=l,2,  n)  and  for 

a  branch  j  (J  “  1,2,  ...,  m)  are  defined  as  follows: 


“i-l 

=  t^  (x^,  d^) 

,  i  —  1, 

2, 

•  •  •  y  n 

.  j  =  1, 

2, 

•  •  •  y  m 

ml 

i 

=  ^  <*1.  ■ip 

,  i  =  1, 

2, 

•  •  •  ^  n 

j.l 

.  j  =  1. 

2, 

•  *  •  ^  fU 

Without  loss  of  generality,  consider  the  basic  system  consisting 
of  one  main  serial  system  (i  =  I,  2,  ...,  n)  and  one  branch  (j  =  1,  2, 
...,  m).  Let  us  assume  that  the  input  and  decision  variables  at  each 
stage  have  the  following  integer  values: 


1 

1 

1 

1 


< 

< 


X,,  <  k.,  , 

Jl  Jl 


dji  <  Pjj^  , 

di  C  pi  , 


J  lj2j 

1  ^  Ij  ^ 

j  ^  2j  ni 

i.  If  2}  ti 
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The  algorithm  Is  developed  by  first  decomposing  the  network  into  four 
phases  and  then  employing  the  usual  recursive  procedures  in  optimizing  the 
total  return.  The  recursion  equations  for  the  various  phases  then  become: 

2.1.1)  For  the  Diverging  Branch  (from  stage  11  to  stage  ml) 


fll(Xii)  =  max  d^^j^) 

1  ‘  '’u  ‘  '■ll 


1  <  djl  <  Pji 


where  j  =  2,  3,  . . . , 

Using  the  above  equations,  the  optimal  branch  return  and  optimal 
decisions  are  computed  for  each  possible  value  of 

2.1.2)  For  the  Main  Serial  Process  (from  stage  1  to  stage  s-1,  i.e. 
prior  to  junction  node) 


ffCxi)  =  max  dj^) 

1  <  dj^  <  pj^ 

fl(Xi)  “  max  tr^(Xj^,  d^)  + 

1  <  dj^  <  P^ 

where  1=2,  ...»  s~l 

The  optimal  return  f^Cx^^)  and  optimal  decision  d^  at  each  stage  are  saved 
for  each  possible  input  value  Xj^. 

2.1.3)  For  the  Stage  s  (junction) 


^s+ml(*s>  =  ‘^s^  ^s-l^^'s^^s’  ‘^s^  ^ml^’^sl^^'s ♦ 


1  <  dg  <  p 


At  this  stage,  the  optimal  return  combination  of 

main  serial  process  preceding  stage  s,  ^sl^^  optimal 

return  from  the  branch,  possible  value  x^, 

both  f  .  i(x„)  and  d„  are  reserved  at  this  stage, 
s+ml  s  s 
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2.1.4)  For  the  Remaining  Stages  (from  stage  s  +  1  to  stage  n,  the 
terminal  node) 

The  optimal  return  at  each  remaining  stage  from  s  +  1  to  n  can  be 
obtained  as  in  the  usual  serial  systems,  i.e.. 


W(*n>  “  f/n<*n*  ^^n^  +  ^n-l+ml^^n^^n*  ‘^n^l 


I  <  d  <  p 


n 


where  l=s+l,  ...,n 

2.1,5)  Determination  of  the  Optimal  Decision  and  Return  at  Each  Stage 

At  the  final  stage  n,  the  optimal  input  to  the  system  can  be 
obtained  by  letting 


1  <  X  <  k 
n  n 


With  the  optimal  input  x  and  optimal  d  obtained  from  a  decision  table* 

^  ^  * 
we  can  produce  optimal  stage  return  and  optimal  stage  output  as 

follows: 


ic 

r 

n 


Cl  = 

This  process  continues  from  stage  n  down  to  stage  s  +  1.  ^ 

At  the  junction  stage  (stage  s),  the  opj^imal  stage  input  Xg  and  stage 
decision  d* ,  the  optimal  branch  input  are  obtained  via  the  transition 
equation 


%1  =  Sl^S*  ‘^s^- 

For  the  remaining  processes,  the  stage  transformation,  return  function, 
and  decision  tables  can  be  used  at  each  stage. 

2.2  A  High  Level  Computing  Algorithm  for  Diverging  Branch  Systems 
(DBCA) 


The  conventional  computer  algorithm  for  performing  the  operations 
listed  in  the  foregoing  generally  employs  a  brute^force  method  to  store 
optimal  decisions  aj  each  stage  and  then  later  retracing  them  after  the 
optimal  return  fj|(Xj^)  has  been  found.  Such  an  approach  dictates  an 
enormous  amount  of  storage  requirement.  The  problem  obviously  gets  worse 
when  large  and  complex  networks  are  involved.  This  is  certainly  the  case 
when  multi  state  variable  dynamic  programming  problems  as  in  converging 
and  loop  systems  are  Involved.  To  mitigate  this  problem,  we  have 
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A 

developed  a  technique  which  marks  the  optimal  decision  value,  say,  by 
adding  kj  -  1  +  k  to  the  state  entry  t^(xj^,  d^)  as  the  processing  of  each 
stage  takes  place.  In  other  words,  k^  is  a  number  larger  than  all 
discretization  levels  and  k  5=  max  (kj^^^,...,  *  *  •  ^'^n^  •  This 

eliminates  the  neejj  for  storing  the  optimal  decision.  Later,  when  the 
optimal  decision  d^  Is  to  be  r^trajed,  it  may  be  retrieved  by  searching 
only  the  Xj^»row  of  for  t^(x^,  d^)  >  k^  over  values  of  d^.  This  is  done 
for  each  i.  Any  future  reference  to  the  table  t^  is  then  made  as  (each 
entry)  mod  k^. 

This  idea  proposed  for  the  elimination  of  the  need  for  storing 
optimal  decisions  can  result  in  a  substantial  saving  of  storage  space. 

For  example,  consider  a  simple  serial  dynamic  programming  system  involving 
S  stages,  D  decision  variables  and  P  state  variables*  If  K  Is  the 
discretization  level  of  the  state  variable,  then  using  the  conventional 
computational  procedure,  the  storage  requirement  for  the  optimal  decision 
is  approximately  SDK^.  In  a  very  simple  case  where  S-10,  D-1,  K= 

1000  and  P  -  2,  this  saving  amounts  to  10^. 

In  a  diverging  branch  system,  tables  F(0;1,1:K  )  are  needed  for 
processing  the  optimal  return  at  each  stage.  At  each  stage  i,  the  optimal 
return  is  processed  by  using  the  previous  stage  optimal  return  from 
F(st,.),  and  stored  in  F(dt,.),  where  st  =  (i^l)  mod  2,  and  dt  =  i  mod  2. 
The  optimal  return  stored  in  table  FM(1:K  ).  Note  that  st  and  dt 

are  used  to  indicate  indices  of  source  and  destination  tables 
respectively. 

To  present  the  algorithm,  we  first  explain  the  notations.  The 
algorithms  are  described  in  a  PASCAL-type  construct.  Comments  are 
enclosed  within  (*...*)•  Enclosure  of  a  simple  or  a  complex  statement 
within  a  loop  is  effected  by  indenting  the  statement.  In  [14]  the 
computer  algorithm  for  the  diverging  branch  system  is  described  in  detail. 
It  consists  of  modules  A  through  G  each  corresponding  to  the  seven 
different  phases  of  optimization. 

3.1  THE  BASIC  DP  ALGORITHM  FOR  THE  CONVERGING  BRANCH  SYSTEM 


Let  us  similarly  review  the  converse  of  the  diverging  branch  system, 
namely  a  converging  branch  nonserial  network.  In  its  simplest  form,  a 
number  of  parallel  serial  systems  join  together  at  a  junction  node  and 
then  feed  their  outputs  to  a  serial  system.  A  simple  example  consisting 
of  two  input  parallel  branches  and  one  serial  output  is  exhibited  in  Fig. 

2  and  is  used  as  the  leitmotif  for  our  algorithm.  In  general,  the 
converging  branch  system  is  treated  as  an  initial  final  value  problem 
(often  termed  a  two-point  boundary  value  problem)  resulting  in  a  two 
dimensional  optimization  problem. 

We  begin  by  noting  that  this  system  is  more  difficult  computationally 
than  the  diverging  branch  system.  However,  as  before,  we  may  still  view 
it  as  consisting  basically  of  a  serial  system  i,  i=  1,  2,  **.,  n  and  a 
branch  j,  j  =  1,  2,  ...,  m  with  convergence  occurring  at  node  (stage)  s. 
The  transformation  at  this  stage  may  be  written  as: 
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*s-l  = 

The  transition  function  for  the  other  stages  may  be  represented  as  in 
the  usual  serial  processes  as  follows: 

For  the  Branches 

,  j  =  1,  2,  m 

For  the  Main 


X  , 

J 


'1 


t .(x. ,  d. ) 
J  J  J 


>  1,  •••>  n;  j 


We  define  the  returns  for  each  stage  similarly.  Thus, 


r 

s 


r  ,  X  ,  d  ) 

s  01'  s'  s 


r . 
1 


dj) 


,  i=l,  2, 


=  r.,(x.-,  d..) 
jl  jl'  jl’  ]!' 


»  J  *“  •••>  ® 


To  develop  the  algorithm,  we  proceed  as  follows:  We  first 
decompose  the  system  into  three  components  corresponding  to  stages  11,  21 
to  ml,  and  1  to  n.  For  stages  1  to  n,  we  separately  consider  stages  1  to 
s-l,  s,  and  finally  s  +  1  to  n.  To  find  the  optimal  branch  return 
fmiCxQ]^)  we  use  the  backward  recursion.  Next,  we  maximize  over 

^ml’  recursion  equations  for  the  different  phases  may  then  be 

defined  as  follows: 


3.1.1)  FOR  STAGE  11 


We  solve  the  problem 


^  ^  ^11  ^  P  11 


s.t.  Xqj^  -  dj^j^) 


In  other  words,  for  each  input  value  x^^^  we  will 
decision  d-^-^  which  satisfies  -  txi^^ll*  ^11^ 
stage  return.  For  each  value  of  Xq^^),  the 


find  the  optimal 

and  also  maximizes . the 

optimal  decision  dj^j^  and 


790 


optimal  return  r^.  are  saved.  We  note  that  a  two  state  variable  dynamic 
program  results  here  as  well  as  In  the  next  phase  of  the  model* 

3.1*2)  For  Stages  21  to  ml 

The  optimal  return  Is  given  by 


j  ^  2ji  •••}  Tn 


1  <  djj  <  p.j 


At  each  |tage  from  21  to  ml,  the  optimal  decision  d. and  optimal 
return  are  computed  for  each  pair  of  *oi^* 

^ml^*ml*  *01^  found  and  the  value  of  which  maximizes  the  branch 
return  for  each  value  of  x^^  is  obtained  such  that* 


’‘ll> 


ml 


3*1*3)  For  the  Main  Serial  Process 


3.1*3-i  The  optimal  return  from  stages  1  to  s  -  1  can  be  found  by 
using  the  usual  recursive  procedure,  i.e*, 


fl(Xi)  =  max  dj^) 

1  <  d^  < 


fl(Xl)  =  inax  [r^(x^,  d^)  +  d^))] 

1  <  dj^  <  Pj^ 

where  1=2,3,  .,,,s-l 


3.1*3-ii)  At  Stage  s  (the  junction  node) 

The  optimal  branch  return  f^T  (x^i »  Xqj^)  is  combined  with  the  return 
at  stage  s  and  the  optimal  return  from  stages  1  through  s  -  1  using  the 
recursion  equation 


■_(x  , 
s  s 


x  , )  =  max  [r  (x 


ml 


s'  01’  s’ 


*01 


where  the  maximization  is  over  1  <  ^oi^  *^01  ^  ^  ^  ^s*  other 

words,  at  junction  s,  we  compute  the  optimal  return  fg(Xg)  and  determine 
optimal  branch  output  and  optimal  decision  dg,  for  each  input  value 

of  Xg*  We  can  also  obtain  the  optimal  branch  input  which  maximizes 
the  branch  return  using  the  value  of  Xqj*  The  use  of  the  decomposition 
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principle  at  the  junction  stage  s  ensures  that  the  optimization  of  the 
main  serial  chain  is  reduced  to  a  sequence  of  one  dimensional  problems. 

3.1.3-iii  For  Stage  s  +  1  to  n 

The  recursion  equation  is  given  by 


^iC^i)  =  max  [r^(x^,  d^)  +  f^_^(t^(x^,  d^))] 


where 


1  <  d.  <  p 
1  ^n 


1  s  ly*..i  n 


At  the  final  stage  n  the  optimal  system  return  for  each  input  value  of 
can  be  obtained. 

3.1.4)  Determination  of  the  Optimal  Decision  and  Return  at  Each  Stage 

At  the  final  stage,  the  optimal  input  x*  can  be  obtained  which 

maximizes  f  (x  )  with  the  optimal  decision  d  obtained  from  the  decision 
n  n  n 

table.  We  will  proceed  from  stage  n  -  1  to  stage  s  +  1.  At  stage  s, 

*  * 

the  optimal  input  x^  and  optimal  branch  input  x^j^  are  found  as 
follows: 


* 

X 

s 


s+1 


*  * 

Now  that  Xg  has  been  found,  the  optimal  branch  input  Xq^  can  be 

obtained.  This  is  because  we  decided  optimal  Xqj^  for  each  value  of  x^ 
when  evaluating  the  optimal  objective  function  value  at  stage  s.  For  the 
remaining  stages  the  optimal  stage  input  and  decision  can  be  obtained 
using,  the  stage  transformation  function: 


X  =  t  .i(x  . , ,  d  n  =  s  -  1,  s  “  2, 

n  n+1  n+l*  n+1  *  *  * 


1 


and  the  decision  table,  respectively. 

3.1.5)  Input  Data  Required  for  the  Algorithm 

The  algorithm,  akin  to  that  developed  for  the  diverging  branch 
system,  is  designed  to  receive  the  following  input  specifications  in 
Pascal: 
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n  =  the  number  of  stages  In  the  main  serial  process 

m  the  number  of  stages  in  the  converging  branch 

s  -  junction  stage 

=  upperbound  of  the  input  value 
-  upperbound  of  the  input  value  x^, 
pfl  =  upperbound  of  the  decision  value  d^^, 

=  upperbound  of  the  decision  value  d^^, 

3*1.6)  Output  List  of  the  Algorithm 

At  the  completion  of  the  operations,  unless  otherwise  specified,  the 
algorithm  outputs  are  akin  to  those  of  the  diverging  branch  algorithm* 

3*2  A  High  Level  Computing  Algorithm  for  the  Converging  Branch  System 
(CBCA) 

We  follow  the  notations  of  Section  2*2  and  note  that  in  processing 
the  converging  branch  system,  tables  Fl(0:l,l:k,l:k)  and  F(0:l,l;k)  are 
needed  to  store  the  converging  branch  and  the  main  branch  optimal  returns 
respectively.  A  table  FM(l;k)  is  also  needed  to  store  the  optimal  return 
at  stage  ml  for  use  later.  For  the  details  of  the  CBCA  algorithm,  see 
[14]. 


4.  COMPLEXITY  ANALYSIS  OF  THE  DBCA  AND  CBCA  COMPUTING  ALGORITHMS 


A  major  deficiency  of  the  literature  of  dynamic  programming 
algorithms  is  the  almost  complete  absence  and  certainly  inadequate 
treatment  of  their  resultant  complexity  issues.  Complexity  analysis  is  an 
important  component  of  algorithms  especially  when  digital  computation  is 
of  interest  or  a  necessity.  In  this  section,  we  discuss  the  complexity  of 
the  two  high  level  computer  algorithms  for  the  diverging  and  converging 
branch  nonserial  systems  respectively.  Our  analysis  focuses  on  issues 
related  to  space  (storage)  and  computational  (time)  complexities. 

For  storage  complexity,  S,  we  assume  that  each  variable  (computer 
word)  takes  a  unit  storage  space.  We  may  then  calculate  the  demand  for 
the  storage  tables  during  computation  ignoring  all  input  tables  and 
intermediate  variables  created  during  the  course  of  processing. 

For  a  performance  profile  of  an  algorithm,  we  consider  the 
computational  complexity  T  as  a  function  of  basic  operations.  Apparently, 
comparison,  mod  operation,  assignment  and  arithmetic  operations  may  be 
considered  as  such  basic  operations.  However,  observation  reveals  that 
the  total  number  of  all  of  the  foregoing  operations  performed  is  roughly 
proportional  to  the  number  of  comparisons.  Hence,  comparison  constitutes 
the  basic  operation  and  T  of  an  algorithm  is  approximated  by  the  number  of 
comparisons. 

As  an  example  consider  a  list  [cj^,C2 , •  •  •  of  values  where  c^  is  a 

composition  of  q  of  these  basic  operations.  A  simple  routine  to  find  the 
optimum  value  V  will  be: 


793 


J  =  1 
o 

For  j  “  n  to  1  do 

If  V  <  C.,  then  Jq  =  J; 

The  maximum  number  of  comparisons  in  this  routine  is  0(n). 

Similarly,  searching  a  value  V  in  the  list  can  take  a  maximum  of  one 
comparison.  Of  course,  more  efficient  searching  algorithms  are  available 
However,  for  the  purposes  of  our  analysis  we  choose  this  brute-force 
method  to  get  an  estimate  on  the  worst  cases.  We  further  focus  on  the 
comparison  where  each  such  operation  accounts  for  a  unit  time. 

A  simple  procedure  for  finding  the  maximum  of  w  items  taking  0(w) 
time  can  be  constructed,  A  binary  search  taking  log2  w  time  may  be  used 
for  searching  a  list  of  w  items. 

Let  k  be  the  disretizatlon  level  of  the  state- as  well  as  the  decision 
variables  l.e.  kj^j^  =  pj^j^  =  kj^  =  p^  =  k.  Also  let  m  and  n  be  the  dimen¬ 
sion  of  the  stages  In  the  branch  and  main  subsystems  respectively.  We 
state  and  prove  the  following  theorems  which  characterize  our  algorithm. 

Theorem  1. 

For  the  diverging  branch  computer  algorithm,  the  space  complexity 
S(DBS)  =  0(k)  and  the  time  complexity,  T(DBS)  =  0((m+n)k^) 

Proof 

For  S(DBS),  we  reason  as  follows: 

The  DBCA  algorithm  employs  tables  F(0;l,l:k)  and  FM(l:k)  which 
consume  2k  and  k  spaces  respectively.  Thus,  S(DBS)  =  2k  +  k  =  3k  =  0(k), 
For  T(DBS)  we  proceed  by  listing  the  time  for  each  of  the  basic 
computations  (steps). 

Step  Time 

m 

*  ’'iiPii 


B 


s-1 

I  k  p 

i=l 


k  p 
s^s 


D 


n 

I  k  p 

i=s-H 


E  k 

n 

n 

F  y  (1  log  p.) 

1=1  ^ 

m 

G  I  log  P^ 

i=l 
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Hence,  T(DBS)  = 


^  ^  log  P.  +  1  ^  It  P 

l=l  ^  i=l  ^  i^l  ^  ^ 


n 


+  kgPg  +  I  k  P  +  ^  1 

®  ®  i=s+i  ^  ^  i=i 


m  n 

^  ^^*^ilPil  ■•■  1°6  Pi^  ■•■  I  ^^iPi  ■‘^  +  1)  +  \ 


”  2  .  .  ..  r  ..  2 


(k  +  log  k)  +  ^  (k^  +  log  k  +  1)  +  k 

i=l  1=1 


(m  +  n)  k  +  k  +  (m  +  n)  log  k  +  n. 


Let  a  denote  the  right  side.  Then  T(DBS)  <  a.  Moreover, 

^  .  1  +  (_1_)  _i  +  ^.  (_2_)  1 


(ni+n)K 


m+n  k  ^2  'm+n'  ^2 


=  1  +  Aj^  +  A2  +  A^,  with  the  A^s,  1  ==  1,2,3  defined  as 


A  12|JS_.0 

^  k‘‘ 


*3-072 

k  k 


Thus  - 1  and  T(DBS)  =  0(a)  =  0((m+n)k^). 

(m+n)k 


Corollary  1; 

The  time  complexity  of  DBS  is  linearly  dependent  on  the  total  number 
of  stages  in  the  system.  Further,  for  a  fixed  number  of  stages,  T(DBS)  = 
OCk^). 
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Corollary  2; 


The  computational  complexity  of  DBCA  is  independent  of  the  stage 
where  the  diverging  branch  starts. 

Theorem  2: 

For  the  converging  branch  computer  algorithm,  the  space  complexity  S(CBS) 
-  O(k^)  while  the  time  complexity  T(CBS)  =  0((m+n)k^). 

Proof: 


The  storage  tables  used  are  Fl(0:l,l;k,l:k),  F(0:l,l:k)  and  FM(l:k). 
However,  after  the  optimal  returns  from  the  branch  have  been  transferred 
to  FM,  FI  can  be  released  and  the  smaller  table  F  may  be  used.  This  means 
that  at  any  one  time  the  maximum  storage  used  will  be  no  more  than  2k^  + 
3k  spaces.  Hence,  S(CBS)  =  0(k^)*  For  time  complexity  we  list  the  time 
taken  by  each  step  as  follows: 


Step 

A 


Time 

m 


i=l 


B 


C 


ml  01 
s~l 


^  P 

s  s  01 


E  I  k  P 

i=s+l 


I  log  p.+  log  k 
i=l 

m 

I  /’‘llPll*  Pll>  ^  ‘‘ml 
1=1 
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Thus, 

m  n 

T(CBS)  =  I  (log  +  I  Pi'*’  ^iPi^ 

i=l  i=l 

‘‘st’s  ‘'^01'  “  *  “oi  *  ’'n 

<  5^  (log  k  +  k^(k  +  1))  +  ^  (log  k  +  k^)  +  k  (k  +  1)  +  k^(k  -  1) 
1=1  1=1 

+  log  k  +  k. 

=  (m  +  1)  k^  +  (ra  +  n)  k^  +  2k  +  (m  +  n  +  1)  log  k 

<  (tn  +  n)  k^  +  (m  +  n)  k^  +  2k  +  (m  +  n  +  1)  log  k 


Let  a  denote  the  right  side  to  that  T(CBS)  <  a  Now, 


a 


(m+n)  k 


=  1 


+  i  +  (- 

k  'm+n 


) 


+  (1 


+  -^) 
m+n 


The  second  terra  converges  to  0.  The  third  and  fourth  terms  are  no 
more  than  and  -  respectively.  Hence  each  converges  to  0. 

k"^  3 

Therefore,  - — r  — 1  T(CBS)  =  0(a)  =  0((ni  +  n)  k  ) 

(m+n)  k^ 

Corollary  3. 

T(CBS)  is  linearly  dependent  on  the  number  of  stages  in  the  system. 
Furthermore,  for  a  bounded  number  of  stages  T(CBS)  -  0(k  ). 


DISCUSSION 


The  algorithms  discussed  in  this  paper  and  developed  extensively  in 
[3]  have  been  extended  to  other  classical  nonserial  systems  such  as  the 
feedforward  and  feedback  loop  systems.  These  cases  are  certainly  more 
difficult  than  the  two  classical  systems  treated  here.  A  natural 
extension  of  these  ideas  is  to  complex  combinations  of  the  four  basic 
structures.  Research  is  in  progress  to  extend  these  ideas  to  multi- 
diverging  and  multiconverging  branch  systems  including  various  forms  of 
branching,  converging  and  looping.  Although  the  pattern  of  analysis  in 
such  systems  is  akin  to  the  ones  presented  here,  it  will  be  shown  that 
memory  requirements  are  functions  of  the  complexity  of  branching  present 
in  the  network. 
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A  MODEL  FOR  SYMMETRIC  VORTEX-MERGER^ 
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We  examine  the  pairing  or  merger  of  two  identical  regions  of  uniform  vorticity  using  an 
approximation  to  the  2D-Euler  equations  based  on  second  order  local  physical  space  moments. 
This  approximation,  that  can  describe  the  initial  phase  of  a  merger,  yields  a  Hamiltonian  system 
of  ordinary  differential  equations  for  the  evolution  of  the  centroid  position,  aspect  ratio  and 
orientation  of  each  region.  The  symmetry  of  the  problem  makes  this  system  integrablc. 
Thereby  we  obtain  a  necessary  and  sufficient  condition  for  merger.  This  condition  involves  only 
the  initial  conditions  and  the  conserved  quantities.  The  existence  of  "pulsating"  solutions, 
observed  computationally,  is  related  to  the  existence  of  two  steady  corotating  states  -  two  fixed 
points  in  a  phase  plane. 


1.  Introduction 

When  two  vortices  with  like  signed  vorticity  are  sufficiently  close  together  they  merge  or 
pair  into  one  vortex.  For  example,  two  identical  circular  vortices  of  uniform  vorticity  merge 
when  the  initial  centroid  separation  is  smaller  than  1.7  diameters.  In  spite  of  the  importance  of 
the  merger  process  —  especially  for  two-dimensional  turbulence  calculations  (McWilliams  (1985)) 
-  there  has  so  far  been  only  a  limited  understanding  of  why  the  process  takes  place.  In  this 

paper  we  present  the  first  simple  analytical  model  of  symmetric  merger  of  uniform  vortices. 

Melander,  Zabusky  and  Styezek  (1985)  have  developed  a  physical-space  moment  description 
for  vortex  interactions  of  the  2D-Euler  equations.  Although  this  model  is  asymptotic  and 
becomes  increasingly  invalid  during  a  meger,  it  yields  a  fair  description  of  the  initial  evolution 
towards  vortex  pairing.  The  moment  model  yields  a  Hamiltonian  system  with  four  ordinary 
differential  equations  for  each  vortex.  When  the  model  is  applied  to  the  present  problem  wc 


^  Pretenied  al  the  Third  Army  Cooference  on  Applied  Malbemalics  and  Computing.  May  t9SS-  To  be  published  in  proceedings  ol  ilie 
neeiiog. 
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obtain  an  integrable  model.  Using  this  model  we  obtain  a  simple  explicit  formula  (28)  for  the 
threshold  of  the  process. 

In  Section  2,  we  discuss  briefly  the  ideas  behind  the  moment  model  and  state  the  equations 
governing  the  interaction  of  two  identical  vortices.  The  solution  to  these  equations  is  presented 
in  Section  3  and  interpreted  through  phase  portraits  in  Section  4.  Dynamical  results  derived 
from  the  moment  model  are  validated  qualitatively  in  Section  5  by  comparing  with  high 
resolution  spectral  simulations. 


2.  Formulation  of  the  model  equations 

We  consider  two  identical  regions  and  of  constant  vorticity  [j.  The  regions  are 

symmetrically  situated  around  the  global  vorticity  centroid,  which  we  for  convenience  take  as 
the  origin  of  a  cartesian  coordinate  system.  The  dynamics  is  governed  by  the  2D-Euler 
equations,  which  in  vorticity-streamf unction  form  are 

d^o)  =  =  0.  (1) 

and 

Ai|/  =  “0).  (2) 

We  approximate  this  problem  using  the  moment  model  derived  in  Melander,  Zabusky  and 
Styczek  (1985).  For  the  sake  of  the  completeness  we  briefly  describe  the  ideas  behind  this 
model. 

The  system  of  equations  (2)  and  (1)  has  the  following  weak  formulation 

i|>(x)  =  -  (1/2tt)  /  a)(x’)  ln|x-x’|dp’,  (3) 

d^/  F(x)a)  dp  =  /(D(7F)*d^x  dp,  (4) 

where  dpsdxdy  and  F  is  an  arbitrary  test  function.  Asumming  that  and  are  disjoint 

ellipses  (of  common  area  A,  aspect  ratio  X  and  orientation  (j)  as  shown  in  Figure  1),  we  may 
express  the  streamf unction  in  terms  of  the  local  moments 

(5) 

^mn  f  m.  ,n  . 

Jjj  =  J  ID  (x-Xj^)  (y-yj^)  dp, 

*'d 

k 

where  Xj^  =  (Xj^,  y^^)  denotes  the  centroid  of  If  xeO^  then 


802 


(6) 


l/2it y ln|x-x-|ilp'=^C_^(x-Xj)J^” 


m+n=0 


is  a  convergent  series  representing  the  far-field,  while  the  near-field  of  ^  is  obtained  from 
the  properties  of  Kirchhoff’s  elliptical  vortex.  By  truncating  the  expansion  (5)  after  m+n=2  and 
restricting  the  test  functions  F  to  be  quadratic  polynomials  of  x  and  y  inside  and  D^,  we 

obtain  a  finite  system  of  ordinary  differential  equations  for  the  local  moments  and  the 
centroid  positions.  Using  the  terminology  introduced  in  Figure  1,  these  equations  become: 

d  X  =  XQA  sin  2(e>4>).  (7) 

4TrR2 

dj^4)  =  SX  +  OAd+X^)  cos  2(e-4>).  (8) 

(1+X)2  8itR2(1-X^) 

d^e  =  Ais  1 1  -  Ad-x^)  cos  2(e-4>)j.  w 

4TrR’2  SttR^X 

dtR  =  A^gd-X^)  sin  2(0-4)).  (10) 

32TrXR^ 


where  A  is  conserved.  These  equations  are  derived  in  Melander,  Zabusky  and  Styczek  (1985). 
Note  that  0  and  ^  enter  only  in  the  combination  0-4».  The  model  is  asymptotically  correct 
to  order  0{£^)  where 

e  =  max  (diameter  ^j)/(2R).  dl) 

Therefore  as  a  merger  progresses  the  model  becomes  increasingly  invalid.  However,  from  past 
experience  the  model’s  results  compare  favorably  with  results  for  the  full  Euler  equations.  That 
is  we  have  found  a  good  agreement  for  critical  merger  distances  -  in  general  the  deviation 
from  contour  dynamical  results  is  only  5-10%.  For  example  the  critical  merger  distance  for  two 
identical  circular  vortices  is  3.VA7¥  for  contour  dynamics  and  3.2/ ATi  for  the  moment 
model. 

Equations  (7)  through  (10)  conserve  the  total  angular  impulse  or  moment  of  inertia 

M  =  2Aw  [R2  -f  A(l+X^)/(4TrX)].  (12) 


and  the  excees  energy 

H  =  1/2  /  (Dip  dp. 
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(13) 


The  first  follows  by  integration  of  (7)  and  (10).  The  second  follows  from  the  truncation  of  the 
expanded  streamf unction. 


The  system  (7)-(10)  becomes  Hamiltonian  when  canonical  variables  are  introduced.  However, 
we  shall  not  use  these  variables  in  this  paper. 

We  make  the  equations  dimensionless  by  introducing  the  following  scaling 

1  /? 

t  =  m,  /?  =  R(Tr/A)  . 


The  dimensionless  centroid  separation  s  becomes  2R(7r/A)^^^.  and  we  introduce 

K  =  l/s2  =  A/(47rR2).  (15) 

A  convenient  normalization  of  M  is 

0  =  27rM/a)A^  >  2,  (Ig) 

which  may  be  written  as 

1/K  =  o  ~  (1+X^)/X.  (17) 

If  C=4>"6  then  we  can  rewrite  the  system  as  two  first  order  differential  equations 

d^X  =  -XK  sin  2<,  (18) 


where 


=  X/(X+1)^,  (20) 

=  (K/2){[_X2^+ K(X^)]  cos  2?  +  2  }. 

X^-l  ~X 


Here  and  ^  are  the  angular  velocities  due  to  the  self-interaction  and  the  mutual- 
a  in 

interaction,  respectively. 
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3.  The  solution  of  the  model  equations 

With  (17)  inserted  into  (18)  and  (19)  we  obtain  an  autonomous  system.  Hence  for  a  fixed 
value  of  a,  a  conserved  quantity,  the  problem  is  within  the  framework  of  a  classical  phase 
plane  analysis. 


Obviously  there  is  a  singularity  in  the  equations  for  X=l,  which  is  due  to  the  way  we 
describe  the  ellipse,  namely  by  an  aspect  ratio  and  an  orientation.  (Oearly  the  orientation  is 
not  well-defined  for  a  circle).  There  are  other  bad  features  of  the  (X.4))~description  as  well. 
For  example  the  same  ellipse  can  be  described  in  many  ways.  (X,<1)),  (l/X,4)+ir/2),  etc.  A 
convenient  set  of  coordinates  for  the  problem  is 


(D,G)  =  (X_M)  (cos  2^,  sin  2f;)  (21) 

X 

whereby  each  ellipse  is  uniquely  represented.  Furthermore,  the  singularity  at  X=1  disappears 
when  the  equations  are  restated  in  the  new  variables 

1/K  =  0  -  /  4  D2  +  G2,  (22) 


M 

/ 

I  0  \ 

V 

/  K(2+KD)  -  2  \  1 

Ir 

\  2+A^D^G^  /  1 

[ 

w/ 

\-dI 

' 

\  K/4+D2  +g’2  / 

In  terms  of  the  new  variables  (D,G)  the  excess  energy  is 

H  =  DK/2  -  ln((2  +  /4+D2-^G^)/K).  (24) 


The  trajectories  in  the  (D,G)-plane  are  the  level  curves  of  H. 


4.  The  merger  condition 

The  natural  way  to  interpret  the  results  is  by  looking  at  the  (D,G)  phase  plane.  There  is  a 
1/2 

circle  of  radius  (o^-4)  and  center  (0,0)  in  the  phase  plane  where  (d^D)*+(d^G)^  is  infinite. 

Along  this  circle  the  centroid  separation  vanishes,  K  ^=0,  Since  R=0  corresponds  to  centroid 
"collapse",  we  name  this  circle  the  collapse  circle.  Only  that  part  of  the  phaseplane,  which  is 
inside  the  collapse  circle  is  of  physical  interest.  The  moment  model  is  valid  within  a  certain 
disk  centered  at  (0,0)  and  with  a  radius,  s3.75  corresponding  to  an  aspect  ratio  X=4. 
Equations  (23)  and  (24)  are  reflection  symmetric  about  the  D-axis,  corresponding  to  time 
reversal  invariance.  Furthermore,  we  find  that  the  aspect  ratio  decreases  when  G<0  and 
increases  when  G>0. 

The  steady  state  solutions  of  (23)  have  been  reported  in  Melander,  Zabusky  and  Styczek 
(1985).  The  aspect  ratios  of  the  steady  states  are  the  real  roots  of  a  fifth  order  algebraic 
equation 
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4X®  -  7aX''  +  (2o^+o+6)X^  - 


(2o2+3o)X^  +  5oX  -8  =  0. 


(25) 


(In  the  previous  work  this  equation  was  expressed  in  terms  of  the  dimensionless  centroid 

1/2  -1 
separation  ■n=2R(rr/A)  ).  Figure  2  shows  the  reciprocal  aspect  ratio  X  ,  of  the  states  as  a 

function  of  o  There  are  one,  two  or  three  steady  states  depending  on  whether 

a<a  or  0^0  ■  All  of  them  are  located  on  the  E>-axis.  A  local  analysis  shows 

Cix  cr  cr 

that  6  is  a  saddle  point  and  A  is  a  center.  C  corresponds  to  a  solution  outside  the  colapse 
circle,  that  is  without  physical  meaning.  Let  us  consider  the  three  cases. 

(A)  a<a  .  Since  there  are  no  steady  states,  therefore  all  initial  conditions  lead  to  centroid 

cr 

collapse  or  merger.  A  typical  phase  portrait  is  shown  in  Figure  3.  The  two  apparent  stationary 

points  on  the  collapse  circle  show  that  centroid  collapse  always  takes  place  at  an  angle  of  45° 

in  the  corotaling  frame.  However,  these  points  are  far  outside  the  region  where  the  niodel  is 

valid. 

(B)  0=0  .  In  this  case  we  have  one  unstable  steady  state.  All  other  initial  conditions  lead 

cr 

to  merger.  We  remark  that  the  fixed  point  does  not  correspond  to  the  limiting  "figure  eight" 
state  of  Saffman  and  Szeto  (1980). 

(C)  o>o  •  This  is  the  most  complicated  and  interesting  case.  When  o  is  only  moderately 

cr 

larger  than  o  ,  we  have  two  physically  significant  stationary  points.  We  observe  from  the 
cr 

phase  portrait  shown  in  Figure  4  that  there  is  a  critical  separatrix  S  starting  and  ending  at 
saddlepoint  B.  All  initial  conditions  outside  S  lead  to  centroid  collapse.  Inside  S  we  have 
closed  orbits  and  the  center  A.  Along  a  closed  orbit  surrounding  the  origin,  ^  increases 
steadily,  so  that  vortices  rotate  counter  clockwise  in  the  corotating  frame.  However,  orbits  not 
surrounding  the  origin,  ^  oscillates  around  zero,  corresponding  to  a  nutation  of  ellipses  in  the 
corotating  frame.  Since  H  has  a  constant  value  on  each  trajectory  in  the  phase  plane,  we  can 
characterize  S  by  the  corresponding  value  of  H  at  point  B,  namely 

H^(o)  =  (X^-l)/[2(aX-X^+l)]  (26) 

-  ln(2  +  /X^+l)/X  ) 

+  ln(a  -  (X^+H/X)  . 

A  Ag 

where  X^  denotes  the  aspect  ratio  of  the  unstable  steady  state  B.  The  region  of  the  phase 
plane  inside  S  is  then  characterized  by 

H(D,G:o)  <  H^(o)  and  X  <  Xg(o).  (27) 

(Note,  H<H^(o)  between  the  outer  separatrices  and  to  right  of  B).  Hence,  merger  or  centroid 
collapse  will  occur  if  and  only  if  one  of  the  following  conditions  is  satisfied; 
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(28) 


H  >  H^(o); 

\  >  Xg(o). 

This  is  the  merger  condition  of  the  moment  model  for  symmetric  merger. 

Now  that  we  know  the  phase  portraits  we  shall  address  the  physical  question  of  why  merger 

occurs  in  the  moment  model.  Consider  the  case  o^o  The  moment  model  gives  reasonable 

cr 

results  only  when  X  is  moderate,  say  less  than  four.  Herewith  we  have  reduced  the  interesting 
part  of  the  phase  plane  to  a  disk  centered  at  the  origin.  All  trajectories  inside  this  disk  are 
either  clearly  inside  the  critical  separatrix  S  or  they  come  near  the  saddle  point  B.  A  local 
analysis  near  the  saddlepoint  is  therefore  justified.  Note  that  a  state  in  the  neighbourhood  of 
B  is  easily  to  recognized  from  the  vorticity  distribution,  because  the  ellipses  are  aligned  (^=0) 
and  fairly  elongated  X>niin  {Xni“2.36.  The  separatrices  divide  this  region  of  the  phase  plane 

into  four  sectors,  labelled  in  Figure  4.  Table  1  presents  simple  rules  for  determining  the  long 
time  evolution  (nonlinear  stability)  from  a  short  time  evolution  (linear  stability  of  the  saddle 
point  B).  That  is  one  sees  the  long  time  evolution  from  less  than  half  a  revolution  in  the 
corotating  frame. 

Table  1 


Sector  Characteristic 


Long-time  evolution 


1 

2 

3 

4 


d^5>0 

dttC^O.  d^C“0 

d^5<0 


pulsation 

merger 

merger 

merger  after  one  pulsation 


From  equation  (19),  we  observe  that  d^/dt  becomes  negative  because  the  clockwise  rotation 

caused  by  the  mutual  interaction  dominates  the  counter-clockwise  rotation  of  the  Kirchhoff 

ellipse.  In  Melander,  Zabusky  and  McWilliams  (1985)  we  discuss  the  consequences  of  12  >12 

m  s 

for  the  full  Euler  equations.  Especially  we  show  that  12  >12  can  cause  filamentation  and 

m  s 

axisymmetrization. 


S.  Oualitgtive  comparison  with  high  resolution  simulations 

In  this  section  we  discuss  spectral  simulations  related  to  the  critical  separatrix  5  in  the 
phaseplane.  We  do  this  to  validate  the  conclusions  obtained  from  the  moment  model  and  to 
highlight  the  importance  of  the  separatrix  S  as  a  threshold  to  merger. 
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The  simulations  are  made  with  a  pseudospectral  code  (Haidvogel  (1985))  which  solves 


-  3yTp3^to  =  -v^A^w 


(29) 


(i)  =  “  All) 

in  the  periodic  domain  [— ji.n]  x  [—rr.u] . 


(30) 


We  specify  the  initial  vorticity  distribution  w(r,4),0)  as  an  idealized  smooth  distribution  of 
compact  support,  and  with  equivorticity  lines  that  are  concentric  ellipses  of  a  common 
orientation  and  aspect  ratio.  Outside  the  ellipse  there  is  no  vorticity  and  inside 

r=/?.(4))  the  vorticity  is  uniform  w=Wp-  The  realative  steepness  of  the  vorticity  gradient  is 

controlled  by  the  parameter  S-(F^-R^/R^.  We  refer  to  such  a  vorticity  distribution  as 

V(6,u  ,a,b),  where  a  and  b  are  the  major  and  minor  axis  of  the  outermost  ellipse  r=/7  (6).  We 
p  o 

have  found  it  convenient  to  specify  to(r,4),0)  as  a  distribution  with  a  monotone  profile  function 

f(r).  r>0 


(31) 


w(r,(l»)  = 


(■ 


t<R. 

1 

R. 


WA  1  -  f  [(r-/?.)/(/7^-/?.)] ,  /?.  <  r  < 

p]  10  11  0 

R  <r. 

0 


We  select  our  profile  function  f  from  the  one  parameter  family  {fj^:k>0}  where 

fj^(r)  =  exp(  -k/r  exp(l/(r-l)  ),  0  <  r  <  1.  (32) 


This  function  smoothly  connects  levels  0  and  1  at  r=0  and  r=l,  and  all  its  derivatives  vanishes 
at  these  points.  A  suiteable  k  is  obtained  from  the  natural  requirement  that  f(0.5)=0.5,  this 
implies  k=2.56085.  With  this  choice  of  k  we  find  f’(0.5)=ln  8  =  2.08,  and  approximately  90%  of 
the  variation  of  the  function  f  occurs  within  the  interval  [0.25, 0.75]. 


Figure  5  shows  a  numerical  simulation  of  the  evolution  of  two  vortex  regions  as  calculated 
on  a  256^-mesh.  The  initial  conditions  are  small- 6  near-lophats  of  elliptical  shape  -  V(0.2, 
10.,  0.6255,  0.4)  -  conesponding  nearly  to  a  corotating  steady  state  solution  Overman  and 
Zabusky  (1982)  for  o=12.66.  By  trial  and  error  we  adjusted  the  centroid  separation  to  1.44 
which  is  slightly  smaller  that  the  critical  merger  separation. 

The  two  vortices  approach  and  at  t=1.0  the  lower  contours  have  joined.  We  observe  that 
the  state  barely  changes  it  shape  in  the  time  interval  from  t=1.0  to  t=3.0.  During  this  period 
the  state  looks  like  the  unstable  steady  state  (B)  predicted  by  the  moment  model.  Note  the 
slow  clockwise  rotation  of  the  individual  vorticies  with  respect  to  a  corotating  frame  (from 
t=1.0  to  t=2.5).  After  t=2.5  we  can  no  longer  distinguish  the  individual  vortices,  since  the 
dissipation  has  erased  the  gradient  between  them. 
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For  t>3.0  the  compound  core  begins  to  axisymmetrize.  We  display  pictures  of  this  evolution 
because  they  constitute  a  unique  example  of  the  evolution  after  a  merger  of  nearly  uniform 
vortices  (comparable  with  contour  dynamics)  and  highlights  the  complicated  entanglement  of 
Tong  filaments.  In  fact  this  simulation  goes  beyond  what  has  been  calculated  with  contour 
dynamics.  At  t=5.0  and  t=6.0  we  see  a  significant  vorticity  shedding  and  the  formation  of 
strong  filaments.  The  almost  complete  absence  of  the  "roll-up"  phenomenon  is  due  to  the 
smooth  vorticity  distribution  and  the  dissipation,  but  has  nothing  to  do  with  the  periodic 
boundary  conditions  (actually  the  periodic  boundary  conditions  are  in  favour  of  the  roll  up). 
At  t=8.0  the  vorticity  shedding  has  stopped  and  a  stable  near  elliptical  core  has  formed. 
However,  the  presence  of  the  filaments  has  a  dramatic  influence  on  the  core,  particularly  when 
they  reattach  to  the  core.  The  figures  of  the  evolution  at  t=9.0,  i=9.5  and  t=10.0  represent  our 
cleanest  example  of  a  reattachment.  The  evolution  beyond  t=10.0  has  not  been  calculated,  but  is 
within  the  framework  of  the  axisymmetrixation  of  a  single  vortex  (Melander,  McWilliams  and 
Zabusky  (1985)).  We  expect  relaxation  to  axisymmetry  through  a  couple  of  further  breakings. 

While  Figure  5  corresponds  to  a  trajectory  slightly  outside  the  critical  separatrix  S  in  the 
phasep^ane.  Figure  6  corresponds  to  a  trajectory  slightly  inside  S.  This  figure  is  the  result  of 
a  128  -mesh  calculation.  We  start  out  with  the  same  initial  conditions  as  before  only  the 
centroid  separation  is  now  increased  to  1.45.  Except  for  the  obvious  effect  of  the  dissipation 
on  the  vorticity  gradients  the  evolution  until  t=2.0  is  the  same  as  before.  Later  it  becomes 
clear  that  the  vorticies  rotate  counter  clockwise  in  the  corotating  frame,  compare  t=5.0  and 
t=6.0.  In  the  time  interval  from  t=2.0  to  t=5.0  we  observe  an  almost  steady  state  corresponding 
to  the  saddlepoint  in  the  phase  plane.  At  t=7.0  this  state  begins  to  break  into  two  vortices 
and  at  t=8.5  only  the  two  lowerst  contours  are  joined.  The  vortices  appraoch  again  a  t=9.0  and 
at  t^lO.O  we  observe  a  near  recurrence  to  the  steady  state. 


6.  Conclusion 

We  have  presented  an  integrable  model  governing  the  evolution  of  two  identical  vortices. 
The  model  yields  an  explicit  merger  condition  (28),  which  involves  both  conserved  quantities 
and  initial  conditions.  For  the  most  frequently  occuring  initial  conditions  -  namely  almost 
axisymmetric  vortices  -  only  the  conserved  quantities  are  needed.  The  threshold  to  merger  in 
the  moment  model  is  a  separatrix  in  a  phase  plane.  Using  specially  chosen  initial  conditions  for 
high  resolution  spectral  simulations  we  have  demonstrated  a  similar  phenomenon  for  the  full 
Euler  equations.  Especially  we  have  shown  that  the  threshold  to  merger  is  a  balance  between 
mutual-  and  self-interaction  of  the  vortices.  This  provides  us  with  a  rule  of  thumb;  If  the 
vortices  -  wfwn  they  are  aligned  -  have  an  aspect  ratio  larger  than  2.3  and  rotate  clockwise 
in  the  corotating  frame  then  merger  will  occur  -  otherwise  not. 
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Figure  1  shows  two  elliptical  vortices  symmetrically  situated  around  the  common  vorticity 
centroid.  The  centroid  of  vortex  1  has  the  polar  coordinates  (R,0)  and  the  major  axis  is  tilted 
the  angle  4).  In  a  frame  rotating  with  angular  velocity  d^0  the  orientation  of  the  major  axis  is 

C- 
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-  n  I  I  I 

0.00  0.05  0.10  0.15 

1/(7 

— ^  aspect  ratio  of  the  steady  corotating  states  of  the  moment  model  is  plotted 

versus  the  dimensionless  conserved  quantity  o=27rM/(jjA.  A( _ )  are  centers  and  B( - )  are 

saddlepoints.  For  these  states  (A  and  B)  the  major  axis  of  the  vortices  are  aiignad  For  a 
given  value  of  o  the  aspect  ratio  X  must  be  smaller  than  [o+/o^]  /2  in  order  to  have  a 
non-vanishing  cetroid  separation,  the  unphysical  region  where  this  condition  is  not  satisfied  is 
shaded  in  the  figure.  The  polynomial  (25)  has  one  real  root  C  in  this  region.  The 

figure  also  shows  results  (o)  from  the  contour  dynamical  calculations  of  Overman  and  Zabuskv 
(1982). 
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Figure  3  The  phase  plane  for  o=10.0<o  .  We  have  scaled  the  axis  by  the  collapse  radius 
—  cr 

0^^9.79. 
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Figure  4  The  phase  plane  for  o=12.5>o 


cr 


11.42. 
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Figure  5  Equivorticity  lines  for  the  evolution  of  two  V(0.2,  10.,  0.6255,  0.4)  vortices  on  a 

256  -mesh  with  v  =3.125x10  .  The  initial  center-to-center  distance  is  1.44  and  the  box  size  is 
4 

2ir.  We  have  subtracted  the  bulkrotation  in  the  pictures  since  it  is  irrelevant  for  our  discussion. 
The  contour  interval  is  0.6. 


t=0.0  t=9.0 


Figure  6  shows  the  evolution  of  the  same  two  vortices  as  in  Figure  5  but  with  the  centroid 
separation  1.45.  The  calculation  is  performed  on  a  128^ -mesh  and  the  dissipation  is  16  times 
strwiger  as  dictated  by  the  mesbsize.  Therefore  the  gradients  quickly  becomes  more  smooth 
than  in  Figure  5.  The  coarser  mesh  is  justified  by  computational  considerations.  In  order  to 
observe  near  recurrence  to  the  unstable  steady  state  the  initial  centroid  separation  must  be 
sligthly  smaller  than  the  critical  merger  separation.  This  distance  can  only  be  found  to  high 
accuracy  through  bisection  -  a  wasteful  procedure.  We  found  this  distance  on  the  128  ^ -mesh 
to  three  significant  figures. 
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Abstract;  We  develop  stable  methods  for  correct  summation  of  expansions  of  data 
perturbed  by  error,  using  eigenfunctions  associated  with  elliptic  operators.  The  tech¬ 
nique  parallels  that  of  Tikhonov  for  Sturm-Liouville  operators  in  one  dimension,  using 
summation  techniques  which  scale  the  summation  parameter  with  error.  Our  analytic 
methods  recover  from  the  perturbed  expansion  a  good  approximation  to  the  correct 
data,  where  the  data  are  sufficiently  regular. 

Introduction:  Some  problems  of  engineering  and  mathematical  physics  can  be  formulated 
as  that  of  solving  the  equation 


Au  =  f,  f&F 

where  A  is  an  operator  mapping  a  metric  space  U  into  a  metric  space  F.  These  problems 
are  called  well-posed  in  the  sense  of  Hadamard,  if 

(a)  there  exists  at  least  one  solution  u; 

(b)  there  exists  at  most  one  solution  u; 

(c)  there  is  continuous  dependence  of  solutions  {u}  upon  the  data  {/}. 

This  last  condition  is  called  stability.  A  classical  example  of  a  problem  which  does 
not  satisfy  (c)  is  the  Cauchy  problem  for  the  Laplace  equation.  Another  is  the  heat-flow 
equation  for  negative  time,  namely 


a^Uxz  =  ut 

u{x,t)  —  f{x)  ,  i  <T 

This  problem  is  unstable  under  small  changes  in  the  data  /,  in  that  small  mean  square 
errors  in  /  produce  large  errors  in  u,  both  point  wise  and  in  U'. 

One  method  of  resolving  such  ill-posedness  is  the  introduction  of  additional  a  priori 
assumptions  on  the  solution  w.  The  Russia.n  mathematician  A.N.  Tikhonov  [Til],  in  a 
fundamental  paper,  applied  this  idea  to  solution  of  ill-posed  inverse  problems.  It  was 

*  Research  partially  supported  by  the  National  Science  Foundation 
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successfully  used  on  the  Cauchy  problem  for  elliptic  operators  in  pioneering  work  by  Fritz 
John  (Jolj.  [Jo2]. 

This  paper  presents  a  regularization  method  for  expansions  in  eigenfunctions  of  elliptic 
operators  on  R" .  The  following  example  illustrates  a  situation  to  which  our  results  can  be 
applied.  If  A  is  an  elliptic  operator  in  the  variable  x  £  R”,  the  partial  differential  equation 

Qk 

=  A%Ij[xA)  (la) 

on  (x.t)  €  R”  X  R~  can  be  viewed  as  a  dynamical  system.  Its  normal  modes  are  multiples 
of  the  eigenfunctions  {u£(x)}  of  .4  and  the  solution  can  be  written  as 

=  (16) 

i 

where  time  evolution  of  the  dynamical  variables  a/(^)  is  governed  by 

d* 

^Of(f)  =  A/0/(f), 

with  \i  the  eigenvalue  corresponding  to  Uf(x).  The  above  summation  schematically  de¬ 
scribes  a  discrete  or  continuous  spectral  expansion.  Equations  such  as  (1)  model  such 
system  as  the  flow  of  heat,  and  the  time  evolution  of  relativistic  quantum  mechanical 
and  electromagnetic  fields.  Given  the  relationship  between  the  “normal  mode”  dynamics 
and  the  “pointwise”  dynamics  expressed  in  (1),  a  natural  question  is  how  perturbations 
in  configuration  space  (in  which  a(t)  =  (01,02,...)  lies)  correspond  to  changes  in  t/;(x,t), 
and.  more  specifically,  how  such  changes  in  il>(x,  t)  can  be  stably  computed.  This  is  clearly 
a  question  about  pointwise  behavior  of  perturbed  expansions  in  eigenfunctions  of  elliptic 
operators. 

Mathematically,  our  regularization  method  is  a  family  of  linear  operators  {0«}«>o  on 
L^fR”)  along  with  a  scaling  function  ^(7),  7  >  0,  such  that  if  {/■y},  7  >  0  is  a  net  of  I/P(R”) 
perturbations  of  /  with  ||/-i  —  /  ||p  <  7,  then  4>i{fy)  approaches  /  in  and  pointwise  when 
e  =  €(7)  and  7  — »  0.  The  results  we  obtain  are  applicable  to  the  class  of  analytic  multipliers 
<Pj(/)  =  <f)[(.A)f  ^  where  A  is  an  elliptic  operator  on  R”.  Such  regularization  methods  for 
expansions  in  eigenfunctions  of  ordinary  differential  operators  have  been  widely  studied. 
Results  for  partial  differential  operators  are  fewer.  Our  results  apply  to  a  large  class  of 
elliptic  operators  on  R",  and  can  be  used  for  both  discrete  and  continuous  spectra.  We 
use  extensively  the  analytic  operator  calculus  (see  [DS]),  and  the  results  and  approach  of 
[GKl]  and  [KR]. 

II.  Previous  Results  and  Problem  Formulation. 

Let  Q  =  (qj,  . . . ,  Q„J  be  a  multiindex,  and  I?*  =  (0~'*'^pr  •  •  • 

|a|  =  Qi  +  ...  +  art.  Consider  the  differential  operator 

A=  Y,  l>MD‘  =  A„  +  B  (2) 

ai<m 


820 


on  R”,  where  Ao  contains  the  leading  terms  and  B  is  the  remainder.  We  assume  Aq  is 
constant  coefficient  positive  elliptic,  while  the  coefficients  of  B  can  be  expressed  as  sums  of 
functions  in  certain  Z/'’ spaces:  b^ix)  6  (|q|  <  m)  where  </  =  sup|(^l<^{^+la|}  < 

m.  We  choose  for  the  domain  of  A  the  L^-Sobolev  space  ZJm  =  (l  —  A)“tLP,  1  <  p  < 
minr^^;  if  p  is  outside  this  range,  then  the  domain  must  be  smaller  and  A  may  not  be 
densely  defined.  The  resolvent  [z  —  of  A.  if  it  exists,  may  be  expressed  as  an  integral 
operator  with  kernel  denoted  by  Lg[x^  y),  x,  y  €  R”.  Sharp  local  bounds  on  the  behavior 
of  Lg  were  obtained  in  [GK2].  Specifically,  let 

h  -  I  if  s  >0,  -ln|x|  ifs  =  0;  |x|  <  1  ,  . 

^  ~  \  |x|-^  |x|  >  1 

The  next  theorem  says  that  the  kernel  of  the  resolvent  operator  is  bounded  by  an 
radially  decreasing  convolution  kernel. 


Theorem  1  (Gurarie,  Kon):  Let  1  <  p  <  min  r^.  There  is  a  constant  C  >  0  such  that  the 

LP-speetrum  of  A  is  contained  in  the  parabolic  domain  fl  =  {x  =  pe*^:  Cpm~^  >  |  sin 
containing  R"*".  For  z  ^  U  the  kernel  of  Rg  =  (x  —  is  estimated  by 

\Lz{x,y)\  <  F{p,0)p^-^h,^t{p^{x-  y)),  (4) 

where  t  >  n,  s  =  max(0,  n  —  m)  and 


F{z)^F{p,d)  = 


sin 


1  - 


Cp 


^-1  \-i 


sin 


(5) 


Furthermore,  the  kernel  of  Rg  —  R2,  where  Rz  =  {z  —  .Aq)  satisfies 


L^z\x,y)  < 


4±n 


Sin 


9.  |n+2 


F[p,B)p  « 


-2 


hs',t[p''^[x-y)). 


with  s'  —  max(n  —  2m  +  d,0). 

We  now  define  our  class  of  regularization  operators,  which  are  expressed  in  terms  of 
an  analytic  multiplier  <t>{z)  with  <^(0)  =  1.  Let 

B-y  =  {2: 1  argx|  <  Tf},  Z?r  =  {2:  |2|  <  r}  (6) 

By  Theorem  1,  the  spectrum  o{A)  is  contained  in  U  Dr  for  r  sufficiently  large.  Let 
(^(2)  be  analytic  in  a  domain  D  C  C  containing  B^  U  Df^  and  F  be  the  positively  oriented 
contour  consisting  of  the  boundary  d[Dr  U  B^).  Using  the  analytic  operator  calculus  we 
can  define  the  operator  I/^(R")  — ^  L^(R”)  by 

d>{eA)f{x)  =  -  eA)^^/(x)dz, 
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where  e  is  small  enough  that  the  spectrum  of  cA  lies  within  F. 

To  guarantee  (l>{eA)  is  well  defined  we  suppose 

(a) 

/  i  ^  <  0®  for  ray  =  {z:  arg  z  =  71},  |7i|  <  7 

(7a) 

(&) 

(76) 

For  our  regularization  methods,  we  must  impose  the  following  stronger  integrability  con¬ 
dition: 

(8) 


for  all  6  >  0. 

It  is  interesting  to  note  the  connection  of  the  regularization  operators  with  the 

application  of  multipliers  for  summing  expansions.  Formally, 

A  A 

with  ux  a  generalized  eigenfunction  of  A  and  A  its  associated  eigenvalue.  The  operator 
formulation  on  the  left  side  is  more  general,  in  that  it  can  be  applied  independently  of 
detailed  spectral  considerations. 

111.  Main  Result  and  an  Application 

We  can  now  formulate  our  problem.  Let  /  €  L^(R”),  1  <  p  <  oo,  and  {/^},  "7  >  0, 
denote  a  net  of  functions  in  1/(R”)  with  ||/  —  f^\\p  <  7.  We  wish  to  determine  a  sharp 
scaling  of  €  and  7  which  guarantees  that  as  7  ^  0,  <i>{tA)f^  f  in  LP(R'*),  1  <  p  <  00, 
and  pointwise  on  the  Lebesgue  set  of  /. 

The  following  theorem  provides  a  large  class  of  LP-regularizing  operators  of  the  form 
<t>{€A)  and  ,a  sharp  condition  on  the  associated  scaling  functions. 

Theorem  2:  Let  A  be  an  elliptic  operator  of  order  m  on  R"  satisfying  the  conditions 
above,  and  4>{z)  be  analytic  on  the  spectrum  of  A,  satisfying  the  boundedness  conditions 
(7)  and  (8).  Let  p  >  ^,  /  €  1^(R"),  and  {f^},  7  >  0,  6e  a  net  of  I,p(R'*)  functions,  with 

11/  -  All,  <i.if 

(®) 

then  tj>{eA)f^  — >■  /  in  I/^(R'*)  (1  <  P  <  00)  and  pointwise  on  the  Lebesgue  set  of  /. 
Furthermore,  this  scaling  of  c  with  7  is  sharp  in  that  if  (9)  fails,  then  for  each  A  there  are 
f,{f-f}  such  that  <l>{tA)f^~l-^ f  on  the  Lebesgue  set  of  /. 

The  proof  uses  the  fact  that  for  f  m  IP,  1  <  p  <  00,  4>{eA)f{x)  converges  as  c  — ^  0 
to  /(x)  in  IP  norm  (1  <  p  <  00)  and  pointwise  on  the  Lebesgue  set  of  /  (1  <  p  <  00); 
it  also  depends  on  Theorem  1,  the  Minkowski  integral  inequality,  and  Young’s  inequality. 
The  proof  is  omitted  for  brevity,  and  will  appear  elsewhere. 
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As  an  application,  consider  a  function  f{t)  in  a  multiparameter  time  domain  {t  — 

,  tn))^  ^ 

^(^'1  =  7^1  /(=)*■'"<*  =  ■  (10) 

(27r)7  Jr”- 

How  do  frequency  domain  errors  (in  propagate  into  time  domain  (/( t ))  errors?  In 

general,  small  function  space  or  pointwise  errors  in  /  will  lead  to  unbounded  errors  in 
pointwise  estimation  of  /. 

This  type  of  problem  has  applications  in  many  types  of  situations;  one  example  is  the 
problem  of  waves  radiating  from  an  antenna  with  fixed  frequency  u,  and  a  given  aperture 
illumination  f(x).  Small  perturbations  in  f{x)  can  lead  to  unbounded  errors  in  the  local 
radiation  field  intensity. 

The  following  provides  a  regularization  procedure  ibr  ameliorating  the  large  pointwise 
errors  in  f{t)  above;  we  choose  to  work  with  for  simplicity. 

Corollary  2.1.  Let  be  an  -perturbation  of  f,  of  size  i.e.,  \\  f  —  f^\\2  < 

(27r)  3  -/R” 

where  <i>  is  a  function  analytic  in  a  region  U  (7,  r  >  0;  see  eq.  (6)),  and  satisfies 
(7)  and  (8).  Let  f{t)  be  the  inverse  Fourier  transform  of  f.  If  the  sealing 

(12) 


holds,  then  the  pointwise  error  in  f^  .j{  t )  vanishes  as  e  0. 

That  is,  if  the  summation  parameter  e  is  scaled  correctly  with  £^-error  'v,  pointwise 
error  in  the  time  domain  vanishes  with  function  space  error  in  the  frequency  domain.  We 
remark  that  the  exponent  —  ^  in  (12)  arises  from  the  fact  that  the  expansion  in  (10)  is  in 
fact  in  eigenfunctions  of  the  Laplacian.  That  is, 

/fiTf  ~ 
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ON  THE  USE  OF  PIECEWISE-POLYNOMIALS  FOR  THE  APPROXIMATION 

OF  CAUCHY  SINGULAR  INTEGRALS 
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Rutgers  University 
New  Brunswick,  NJ  08903 


ABSTRACT:  In  this  paper  we  propose  piecewise-polynomial  methods  for  the 

approximation  of  Cauchy  principal  value  integrals  and  develop  a  simple ^  efficient  and 
numerically  stable  algorithm  for  the  evaluation  of  the  weights  of  the  resulting  piecewise- 

polynomial  quadratures.  We  present  two  examples  to  illustrate  the  advantages  of  these 
quadratures  versus  the  Gauss-Jacobi  quadratures, 

1,  INTRODUCTION 

In  this  paper  we  consider  the  numerical  approximation  of  the  Cauchy  principal  value 

integral 

/(?;«)=  /  =  lim  { /  +  /  (1) 

J^l  t  —  S  £  — »  U  J_i  y*+£  t  —  8 

where  ^(t)  is  a  Holder  continuous  function  in  |-1,  1]  and  w(t)  is  a  positive  integrable 
weight  function  given  by 

w(t)  =  (i-t)-“(i+r)-^  (2) 

where  a  and  ^  are  constants  and  a,  ^  <  1.  There  has  been  an  increasing  interest  in  the 
numerical  approximation  of  (1),  particularly  since  such  approximations  may  be  used  to 

solve  the  Cauchy  Singular  Integral  Equation  (CSIE): 

a(«)tti(«)g(<)  +  — =  /(«),  |<|<1  (3) 

It 

where  «(«),  6(»)  and  /(«)  are  given  input  functions  and  <;(<)  is  the  unknown  function. 
CSIE’s  arise  in  areas  such  as  aerodynamics,  fluid  and  fracture  mechanics,  wave-guide 
theory,  scattering  and  other  areas  (e.g.  Erdogan  et.  al.  [6],  Gerasoulis  [8],  Lots  [12],  Miller 
and  Keer  [13]). 

The  majority  of  numerical  methods  proposed  for  (1)  are  global  methods,  usually  based  on 
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orthogonal  polynomial  approximations  (e.g.  Paget  and  Elliott  []4j,  Erdogan  et.  ah  [6], 

Welstead  [17]).  Very  little  attention  has  been  paid  to  local  methods,  mainly  piecewise- 
polynomial  approximations,  even  though  such  methods  were  used  as  early  as  1950  for  the 
special  case  =  1  (e.g.  Lotz  [12|,  Ivanov  [lOi). 

It  is  well  known  that  global  methods  converge  very  fast  for  differentiable  input  functions 
with  ‘‘small”  derivatives.  However,  there  are  two  difficulties  associated  with  their 

implementation.  First,  although  orthogonal  polynomials  exist  for  the  approximation  of  (l) 
(Elliott  [5]),  similar  polynomials  may  not  exist  for  (3),  since  nonclassical  weight  functions 
might  arise  for  certain  6(5)  (Welstead  [17,  p.  115]).  Only  for  special  cases  of  b{s)  the 

existence  of  orthogonal  polynomials  has  been  proven  (Elliott  [5]).  Even  if  such  polynomials 

exist  for  (3),  considerable  computational  effort  is  needed  to  generate  their  recursion 
coefficients  (e.g.  Welstead  [17,  p.  115]).  Second,  since  the  node  points  are  generally 
chosen  as  the  zeros  of  an  orthogonal  polynomial,  global  methods  are  not  appropriate  for 
integrals  with  input  functions  that  behave  “badly”  in  some  subinterval  [a,  b]  of  [-1,  1]  (e.g. 
Paget  and  Elliott  [14,  pp.  381-384]).  For  such  integrals,  a  numerical  method  with  no 
restriction  on  the  choice  of  node  points  would  have  to  be  used  in  order  to  concentrate  the 
nodes  in  [a,  fcj.  This  is  not  possible  with  global  methods. 

In  contrast,  local  methods  based  on  piecewise-polynomial  quadratures  afford  a  flexible  choice 
of  the  node  points.  However,  their  implementation  suffers  from  the  lack  of  efficient 
numerical  algorithms  for  the  computation  of  the  quadrature  weights.  In  this  paper  we 
address  this  issue  and  propose  a  simple,  efficient  and  numerically  stable  method  for  the 
computation  of  the  quadrature  weights. 

In  section  2,  we  develop  a  quadrature  for  /(^;^),  by  approximating  g{t)  with  piecewise- 
linear  polynomials.  We  also  develop  an  algorithm  for  the  evaluation  of  the  quadrature 
weights  by  splitting  the  interval  [-1,  l]  into  two  subintervals  [-1,  O]  and  [0,  l],  and  in  each 
subinterval,  we  expand  the  nonsingular  parts  (1--*)“^  and  (1-1-*)“^  into  Taylor’s  series  and 
integrate  the  singular  parts  of  «;(«)  “exactly”.  The  quadrature  weights  are  computed  via 
two  fast-converging  series.  Finally  in  section  3,  two  numerical  examples  are  presented.  In 
the  first  example  we  extend  the  piecewise-linear  quadrature  to  weakly  singular  integrals  of 

the  form  u;(«)j/(t)c/i,  where  uj(f)  is  defined  in  (2).  We  illustrate  the  advantages  of  these 

quadratures  versus  the  Gauss-Jacobi  quadratures  by  choosing  g(t)  —  >/i7|  and  also  describe 
how  the  analysis  for  piecewise-linear  quadrature  may  be  extended  to  higher  order 
approximations  (e.g.  piecewise-quadratic,  etc.).  In  the  second  example  we  demonstrate  the 
usefulness  of  the  piecewise-linear  quadrature  for  the  approximation  of  integrals  with  weight 
functions  of  the  form  with  ti;(t)  =  where  U(t)  is  a  positive  continuous 

function.  Such  weight  functions  arise  in  the  solution  of  CSIE’s  with  variable  coefficients 
(e.g.  Elliott  [5],  Welstead  [17]). 
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2-  PIECE  WISE-POLYNOMIAL  QUADRATURES 


We  begin  this  section  with  some  preliminary  mathematics*  We  rewrite  (l)  as 


I[g  -,e)  =  J(g  -,s)  +  ?(«)9o(*)-  A9  ,^)  =  J  ^w(<)  -  — - — dt,  ggls)  =  j  ^  W 

and  assume  that  gQ{f)  exists.  A  sufficient  condition  for  its  existence  is  that  u/{t]  is  Holder 
continuous  in  every  open  subinterval  of  [—1,  l),  which  is  true  for  the  weight  function  in 
(2).  The  function  gQ(«)  satisfies 

(jo(«)  =  -JTCot  (ffOi)  u;(a)  -  2  («+P)  F(1,-(q+/?)  ;  1-hQ  ;  — ).  ft  0,-1,...  (5) 

where  F(l,-(a+/3) ;  l+ot ;  is  the  hypergeometric  and  r(z)  the  gamma  function  (e.g. 
Tricomi  [16]).  For  several  important  special  cases  the  hypergeometric  function  may  be 
simplified  further.  For  example,  if  ft+^  =  «  and  k  is  an  integer,  then 


l(-a~-p-|-l)  2  Sinj^ra) 


(6) 


where  is  the  Gauss- Jacobi  polynomial  of  degree  k.  We  are  now  ready  to  introduce 

the  piecewise-linear  quadrature. 


2.1*  Piecewiae-Linear  Approximations 

We  subdivide  j-l,  1)  into  n  subintervals  [f^,  t.+i),  t,  <  t,+i,  »  =  0(l)n-l,  =  -1,  =  1, 
and  define  the  stepsize  h,-  =  tj+i-tj,  »  =  0(1) n~l.  The  piecewise-linear  polynomial 
approximation  g„(t)  of  g{t)  is  defined  by 

9(*t+l)-9(‘i) 

9„(0  =  «i(<-<i)  +  if  tj+il,  ai  =  - 7 - ,  6,  =  g(ti),  i  =  0(l)n-l.  (7) 

Furthermore,  we  set 


/,  =  /  w{t)dt,  I,{s)  ^  4  — ^.(*)  = 

Jt^  I  s 

for  t  ^  0(l)n-l,  and  present  the  following  theorem. 

Theorem  1:  (1)  For  «  €  [<y,  ty+il,  1,  ,  w-l},  we  have  Ay{s)Ij(s)  =  0  and 

n-l 

•f(?n : «)  =  H  { “lA  +  A(«)A(«)  }• 

t=0 

(ii)  1/  (^(t)  is  continuously  differentiable,  then  ; «)  converges  uniformly  to  J(g ;  «). 


(8) 


(9) 
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Proof:  (i)  We  first  show  that  =  0  for «  e  Since  «;(#)  is  Holder  continuous 

in  every  closed  subinterval  of  (-1,  1),  then  there  exists  an  <  >  0  so  that  the  integrals 


4  - dt  =  I  - — - dt  /  - ,  ji— l(l)n-2 

Jt-  t-6  Jt.  t-S  t-S 

f^}+lw{t)  f’+^w(t)-w{8)  f’-^w(t)  f^j+lw{t) 

-f  - dt  ~  I  - dt  +  /  - dt  +  /  - dt,  )  —  0,  fi— 1, 

Jt^  t-8  J,-c  t-s  Jt-  t~e  J,+(  t-s 

are  bounded.  Moreover,  AJ(s)  =  0,  for  *  e  J  =  0(l)n-i  and  therefore 


f‘j+i  w(t) 

=  °  for 

Jij  I  s 

Similarly,  for  s  =  -  0  and 


(10) 


/'^i+2  «;(() 

Ajls)  4  -^dt  =  0,  -  0(l)n-2. 

•'tj  *  *j  +i 

By  substituting  g^(t)  in  the  second  equation  in  (4),  by  rewriting  (4)  eis 


(11) 


•^(?n;*)  =  /  “'(0~ 

1=0 


‘t+i  <*i(*^ti)+*i  “  ajU~tj)-~bj 


t~t 


dt 


(12) 


and  by  using  (10),  (ll)  and  (12)  we  derive  (9). 


(ii)  The  proof  of  convergence  shown  by  Stewart  [15]  for  a  =  0  =  1/2^  also  applies  to  the 
weight  function  (2)  for  all  a,  <  1.  □ 

We  now  consider  the  quadrature  approximation  for  /(^;«).  We  define 


M.)  =  C 


/,(*)  if 

otherwise 


/i  +  (^"i.)M«) 

"i  - - - ’  *  =  o(i)n-i, 


(13) 


•"ot*)  =  M*)-^*)-  *"n(*)  =  ‘'n-l(«).  “'ll*)  =  •'i-l(») +  li(«)  -  M*)-  ‘  =  1(1)«-1.  (14) 

and  present  the  following  corollary,  which  can  be  shown  directly  from  Theorem  1. 


Corollary:  The  piecewise -linear  quadrature  for  I{g ; «)  is  given  by 


828 


n 


(15) 


n-] 

^(?n5«)  ~  XI  ^9o(<)  - 

1=0  i=0 

where  ?„(^),  ^o(^)  defined  in  (7).  (4)  and  (14)  respecAively.  □ 

It  is  clear  from  Theorem  1  that  the  main  difficulty  with  the  piecewise-polynomial 

quadrature  (9)  is  the  evaluation  of  /■  and  /*(«).  Several  authors  have  considered  particular 
cases  of  w(t).  Flugge-Lotz  [12],  Atkinson  [2]  and  others  (see  the  references  in  Stewart  [15] 
and  Ivanov  |10])  have  studied  the  case  u;(<)  =  1.  For  this  case,  we  see  that  the  integrals  in 
(8)  have  a  closed  form  expression,  e,g.  i,  =  ^nd  7^(^)  =  In  Closed 

form  expressions  have  also  been  derived  for  the  cases  a  =  /3  -  ±1/2  by  Gerasoulis  [7],  [8], 
Gerasoulis  and  Srivastav  [9],  Jen  and  Srivastav  [llj*  Such  closed  form  expressions  do  not 
exist  in  general  and  a  numerical  method  must  be  used*  However,  since  the  integrands  in 
(8)  are  singular  at  ±1,  classical  quadratures  are  not  appropriate  for  their  approximation.  It 
is  only  recently  that  an  attempt  was  made  to  develop  numerical  methods  for  the  evaluation 
7,  and  /,(«):  Miller  and  Keer  [13]  suggested  an  interesting  technique  for  estimating  7,  and 
7,(fl),  They  first  expand  the  weight  function  ii;(/)  -  (l-t)“*^(l±t)~^  into  a  Taylor’s  series  in 
(-1,  l)  and  then  integrate  the  series  ‘‘exactly”.  However,  since  the  Taylor’s  series  is 

unbounded  at  ±1,  the  integrated  series  converges  slowly  near  these  points* 

In  the  next  section,  we  introduce  a  simple,  efficient  and  numerical  stable  method  for  the 
evaluation  of  7,  and  7^{fi).  The  underlying  idea  in  the  evaluation  of  I-  and  I^(s)  is  similar 
to  “product  integration”  which  has  been  successfully  used  in  quadrature  approximations  of 
weakly  singular  integrals  (Atkinson  [l,  p.  272]).  Thus,  instead  of  expanding 

u;(i)  ^  in  (-1,  1)  as  it  has  been  proposed  by  Miller  and  Keer  [13],  we  first 

split  the  interval  [-1,  l]  into  two  subintervals  [-1,  0]  and  (0,  l|  and  expand  the  part  of 

M;(t)  which  consists  of  a  C“  function  and  integrate  the  other  part.  Note  that  (1-/)“^  is 
in  [-1,  0)  and  that  (l±t)“^  is  C°°  in  [0,  ij.  Clearly,  this  approach  would  be  practical 
only  if  the  integrated  part  can  be  computed  very  efficiently  to  within  a  given  tolerance. 
Fortunately,  as  we  shall  see  in  the  next  two  sections,  this  is  possible  for  the  weight 
function  defined  in  (2). 

2*2,  The  Numerical  Evaluation  of  Ij 

In  Theorems  2  and  3  we  present  a  series  approximation  for  7^  and  analyze  its  convergence. 
Note  that  each  step  in  the  series  is  computed  recursively  and  therefore  very  few 
computations  are  performed  per  summation  step. 

Theorem  2:  The  integrals  7^,  i  —  0(l)n“l,  defined  in  (8)  can  be  evaluated  from  (a),  (b) 
and  (c)  below^ 
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(a)  If  i  is  such  that  0  <  t,  <  and  p  6  [0,  li. 


A  =  (i+p)“^L  ^-1) 


r  t\k_ 


r(^)t!(l+p) 


k 


(16) 


A:(l-p) 


1 


Vi  =  o  (i?) 

(b)  If  t  is  such  that  «,  <  <0,  and  pe[-l,  Oj, 


r(a)k!(l-p) 


k  '•-* 


(18) 


ib  f  1  j  B  ^  j 

(l  +  ‘t-n)^“^(<.  +  l-p)*  -  (l  +  ‘,)^“^(t.-p)*}.  A  -1  =  0-  (19) 

(c)  If  i  is  such  that  <  0  <  then  this  interval  is  split  into  two  subintervals  ]/,,  0] 
and  [0,  and  1^  is  evaluated  by  using  (a)  and  (b)  in  each  subintervaL 


Proof  :  We  prove  only  part  (a)  since  the  proof  for  part  (b)  and  (c)  is  similar.  The 
Taylor’s  series  expansion  of  is  given  by 


si'-'’! 


(1+0-^  =  (i+pi-'’f; 

r(/3)*!(i+p)* 

which  converges  for  all  points  p  €  [O,  Ij.  By  multiplying  with  and  by  integrating 

the  last  equation,  we  derive  (16),  where 


f 

Jt, 


i+i  (<-p)* 


(i-tr 


dt. 


(21) 


To  show  (17),  we  consider  the  indefinite  integral  A,Jfe  integrate  it  by  parts 


=  1^.'“  =  (‘-P)*!*-')'-”  -  ‘/ 

which  reduces  to  the  following  remarkable  recursion 


i(i-p)-(f-p)i(t-p)*' 


-dt} 


(22) 


/  _  p  (l-t)^-«(<-p)* 

•’*  1+ifc-a  •’*'“*  1+ib-o 

Then,  (17)  is  derived  by  taking  the  integration  limits  in  (23). 
then 


(23) 

Furthermore,  since  a,  /0<1, 
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^  -  l^j^a  -  -  l  +  k-P  -  ^1’ 

which  imply  that  the  recursions  (17)  and  (19)  are  numerically  stable  [3,  p,  17].  □ 

We  now  consider  the  rate  of  convergence  of  the  partial  sums  where 

r  =  h..  4 "  =  i: 

k=o  mk\{i+p)>^^^ 

Theorem  3:  If  p  =  t-  or  p  =  then  the  Temainder  Rj^  of  the  series  in  (16)  satisfies: 


'"'•I  i  ITT  s— TT 


^m\  1-0 

- ,  for  all  N  >M=  j - ^1, 


where  A  =  —  <  1, 
i+p 

Proof  :  We  will  show  that  \di^\  <  A|(i^_2|  and  A  <  1  for  all  k  >M,  Since  0  —  0,  -1, 
implies  —  0,  for  all  Jfe  >  -/J,  we  need  to  consider  only  the  case  0  0,-1,  We 

see  that  the  assumption  p  =  or  p  =  i-  and  (21)  imply 

\k—\^0\h^ 

IA,/tl  <{ti+i~ti)\kk-l\’  and  from  (24)  (26) 

1-0 

Furthermore,  since  0  <  1  and  0  <  p  <  1  then  |ifc“l+^|  <  A:  for  all  >  JVf  =  [^^1?  and  by 

h, 

using  (26),  141  <  where  A  =  <  1*  From  the  last  equation  in  (24) 

“  Kw+ll  Mdsl 

i«nI  <  i<i*i  <  Z  =  -jzr  -  TT-  ^  ^  M 

Jfc=7V+l  fc=N+l 

where  we  have  used  the  sum  of  the  the  geometric  series  [3,  p.  63].  □ 


This  Theorem  implies  that  the  partial  sums  converge  very  fast  to  7^.  For  example,  if 

0  -  1/2  and  h-  =  .2,  i.e.  n  =  10  subintervals,  then  \Rjsj\  <  4x10'^  for  ^  <  9,  while  if 
-  ,02,  i.e,  n  =  100  subintervals,  then  |72^l  <  3x10“®  for  jV  <  3.  The  bound  for  the 
remainder  of  the  series  in  (18)  is  similar  to  (25). 


2,3.  The  Numerical  Evaluation  of  1^(6) 

The  piecewise-linear  quadrature  derived  in  Theorem  1,  also  requires  the  evaluation  of  7,(«), 
*  =  0(l)n-l,  for  In  the  next  two  Theorems,  we  present  a  series  approximation 

for  7^(^)  and  analyze  its  convergence.  Note  again  that  each  step  in  the  series  is  computed 
recursively  and  therefore  very  few  computations  are  performed  per  summation  step  (i.e.  9 

multiplications  and  5  additions  per  step). 
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Theorem  4:  The  integrals  /,(«),  i  =  0(l)ri  1  defined  tn  (8)^  can  he  evaluated  for 
« 0  (<ij  from  (a)j  (b)  and  (c)  below.  where  the  point  p  is  chosen  as 


P  = 


c 


t,  if  s  > 


‘,+1  if  «  <  h 

(a)  If  i  is  such  that  0  <  <,  <  then 


(«-p) 


(27) 


k^O 


where  /j  is  defined  in  (17)  and  has  a  closed  form  expression  for 

=  0,  -1:  -2,  e.g,  if  f3  =  0  then  =  l/(p-fi),  while  for  ^  0,  --1,  -2, 


,  ,  ,  ,  (A:“l-f/3)  6—p  (l-hp)^^ 


k  1+p 


p-6 


(28) 


B_^(s)  =  0* 


If  numerical  underflow  occurs  in  the  evaluation  of  by  (27)  then  the  following 

equivalent  recursion  should  be  used: 


If  <  1  ,  then  set  _i(«)  =  0  and  evaluate 

1 


k  1—p 
-P 


for  k  =  0,  1,  If  I — ^  >  1  then  choose  a  sufficiently  large  set  /,  ;v(^)  ^  ^ 

evaluate  ^  ~  ^^ing  (29)  backwards. 

(b)  If  i  is  such  that  <  0,  then  I-{6)  is  evaluated  from  (27) ^  where  ^  is 

defined  in  (19)  and  Bf^(6)  has  a  closed  form  expression  for  a  —  0,  —1,  —2,  while  for 
a  ^  0,  ”1,  “2, 


/V  .V  .  ^  (k—l-\-a)  s—p  {1“P) 


-p'  -  -  '■  ^  p-s 

B^iie)  =  0.  Similarly,  if  <  1,  then  set  =  0  and  evaluate 

k  l^p  1 


(30) 


for  it  —  0,  1,  while  if 


U+p| 

|«-p| 


>  1  then  use  (31)  backwards. 


(c)  If  i  is  such  that  <  0  then  this  interval  is  split  into  two  subintervals  (i^-,  O] 

and  [0,  and  /,(fi)  is  evaluated  by  using  (a)  and  (b)  in  each  subinterval. 


832 


Proof:  We  prove  only  part  (a)  since  the  proof  for  (b)  and  (c)  is  similar.  From  the 
definition  of  p  and  the  fact  that  \^-p .  for  <  t  and 


1 


{ 


P-S  l-{t-p)/(6-p) 

By  combining  (20)  and  (32)  we  obtain 


—}  =  —E  (—)''■ 

—  nl  p^s  ^  n 

k=0 


(32) 


k^o 


s-p 


^  '  ’  r(;3)jl  ' 


(33) 


which  reduces  to  (27)  by  multiplying  with  (1-t)  ^  and  by  integrating,  where 


A-,*(0  =  (34) 

Ji-  s  p 

The  recursion  in  (28)  can  be  easily  deduced  from  the  definition  of  Bi^(s)  in  (33).  Moreover, 
since  <  1  for  p  €  (0,  1]  and  s  €  (-1,  1),  this  recursion  is  numerically  stable. 

Equation  (29)  may  be  derived  by  dividing  (17)  by  If  r'^<l  <^hen  (29)  is 

p\ 

numerically  stable.  If  ^  >  1  then  (29)  is  unstable,  but  its  equivalent  backward  recursion  is 
stable.  To  determine  a  starting  point  N  for  which  the  backward  recursion  computes 
k  =  (N-1){1)0  with  an  error  less  than  e,  we  use  (34)  to  derive 


< 


|N 


\N 


|/^ol  < 


implies  N  > 


ln(^/IA,ol) 


^2  - 


(35) 


\s-pr 

where  1/^q|  is  defined  in  (17)  and  By  setting  f,;v(^)  “  ^5  then  the  error  e  will 

be  multiplied  by  <  1  in  each  recursion  step  and  therefore  all  k  =  (iV-l)(l)0,  will 

be  computed  with  an  error  less  than  e,  provided  that  the  maximum  error  in  floating  point 
arithmetic  is  less  than  c  [3,  p.  17],  □ 


We  now  consider  the  rate  of  convergence  of  the  series  in  (27).  We  define 
N 

=  E  «Ni*)  =  A(')  -  =  E  (^®) 

*=0  k^N+1 

Theorem  5:  The  remainder  i2;v(«)  of  the  series  in  (27)  satisfies: 

|i«;v(*)|  <  C(*)A^^'|/.,ol  for  all  N  >  M  =  f^l.  (37) 

where  Aj  <  1  is  defined  in  (35)  and  C(s)  is  independent  of  N. 

Proof:  We  can  easily  see  that  the  choice  of  M  in  (37)  and  /3  <  1,  implies  |y— H-;3|  <  j  for 
all  j  >M.  Thus,  from  the  definition  of  bj(s)  in  (28)  we  derive 
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i6j(<)i  <  y  =  L,  i+1,  for  all  L  >  M 


(38) 


where  ^  ~  <  1  for  p  €  [0,  1]  and  1).  The  second  equation  in  (33)  implies 


i+P 


=  B^(s)  -  ^  fc^.(«).  B„(*)  == 


(1+*) 


-0 


j=k-\-l 


p-8 


and  by  using  the  sum  of  the  geometric  series  and  (38),  we  derive 


ms)\  <  |B^(*)|  + 


l-A. 


k  >M. 


(39) 


(40) 


Furthermore,  (34)  implies 


iA,*(*)l  ^  -^2  ^iA,i(«)l.  k  =  L,  L+1,...,  and  X  >  0,  (41) 

and  since  ^  I  6-p\  from  the  assumptions  of  Theorem  4,  then  A2  <  1.  By 

substituting  (41)  and  (40)  into  the  last  equation  in  (36)  and  by  using  (38)  we  derive 


00 


fc-jvl^JV4-l(*)l  t-JV-l 
1  (l-A,)  2 


< 


(42) 


|5co(*)l  \f>N+l(^)\ 

^  (I-A2)  +  (l-A)Ai(l-A,)  ^ 

where  A  =  A,  A2  =  is  also  defined  in  Theorem  3.  By  using  (41)  the  last  inequality 

reduces  to  (37),  where,  because  of  (38),  we  may  choose 


C'(.)  =  { 


l^oo(«)l 

(1-A2) 


+ 


IM*)I 

(l-A)(l-A,)^-  ° 


We  see  that  the  speed  of  convergence  of  i2;v^(«)  depends  on  A2  <  1*  If  A2  is  close  to  1,  then 

will  converge  slowly.  Since  slow  convergence  may  be  encountered  in  the 

intervals  and  particularly  if  s  is  near  /y  or  ty^j.  To  avoid  this,  we 

select  «  to  be  a  node  point.  Since  i4y(fl)/y(«)  =  0  and  ^y_i(«)/y_i(«)  =  0  if  £  =  ty,  then  we 
do  not  have  to  evaluate  /y_](«)  and  /y(*)  in  these  intervals.  Furthermore,  if  we  assume  that 

the  mesh  is  uniform,  then  we  can  easily  see  that  A2  =  1/2,  1/3,  ...  in 

(iy_2,  ^j-2h  -1  respectively.  Note  that  (l/2)^®  ^’3x10“®,  (l/3)^®  =  7x10“®, 

(1/4)^^  GxlO"®,  which  imply  that  only  few  subintervals  near  [ty,  fy^j]  will  need  iV  >  10  for 
i2^(fi)  to  become  less  than  (7(fi)  10“®. 
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3,  NUMERICAL  APPLICATIONS 


The  algorithm  described  by  Theorems  1,  2  and  4  is  simple  and  computationally  efficient. 
Note  that  the  numerical  computation  of  and  I^{s}  requires  the  evaluation  of  a  single 
summation  as  opposed  to  three  or  four  nested  summations  needed  by  Miller  and  Keer’s  fl3] 
algorithm.  Moreover,  for  a  uniform  mesh,  the  total  amount  of  computations  needed  for  the 
evaluation  of  /*  and  f^fs)  may  be  cut  in  half  by  using  the  identity  _  f^t+2 

w'here  it  is  assumed  that  the  and  I-(s)  are  saved  (stored)  when  the  mesh  increases  from 
n+1  to  2n-r  1  points.  We  now  present  two  examples  which  illustrate  the  advantages  of  the 
piecewise^polynomial  quadratures  *  versus  the  Gauss-Jacobi  quadratures,  particularly  for 
“badly”  behaved  integrands. 

Example  1:  The  piecewise-polynomial  quadrature  can  be  used  to  approximate  weakly 
singular  integrals  of  the  form: 


rl  ^ 

J  ^  f  w{t)g{t)dt  «  g(t,),  w{t)  = 

i=0 


;(1) 

“^0  “  ^0  “  T~ ’ 
"0 


‘n-1 


Wj  =  /,  -  —  +  7 - ,  »■  =  l(l)(fi-l), 


(43) 


(44) 


where  /,■  is  defined  in  Theorem  1  and  ^(i)  (<-r,)4i.  The  series  expansion  for 

may  be  obtained  from  (16)  and  (18)  by  replacing  with 


Extensions  to  higher  order  piecewise-polynomial  approximations,  i.e,  quadratic,  cubic,  etc.,  is 
straightforward.  The  weights  for  piecewise-polynomial  methods  of  order  m  >  1  may  be 

obtained  as  linear  combinations  of  w^(0  (t~  t^ydt^  j  =  0(l)m,  The  integrals  1^^ 

may  be  estimated  by  replacing  with  /,  jt+j  (1^)  Since  used  in  the 

evaluation  of  all  quadrature  weights,  very  few  additional  computations  are  needed  for  the 
estimation  of  the  weights  of  higher  order  methods. 


In  Table  3-1  we  present  the  numerical  results  of  the  quadrature  in  (43)  for 
9(t)  =  \/itj,  u;(t)  =  (1-t^)  and  for  two  different  choices  of  mesh  points.  The  decay 
exponent  and  the  order  of  convergence  of  the  Error  =  e„  =  |J— J„|  are  defined  by 
p  =  In  (e^/e2„)/ln  (2)  and  0(n~^]  respectively.  The  decay  exponent  for  the  uniform  mesh 
=  ~1  +  ih,  i  —  0(l)n,  h  =  2/n  tends  to  p  =  1,50,  which  is  expected  for  the  square  root 
integrand  (Atkinson  [l,  p.  255]).  The  global  Gauss-Chebyshev  quadrature,  e.g. 

Jg  =  <,  =  cos  ((2t-l)>r/(2n)),  i  =  l(l)n,  also  converges  with  p  =  1.50.  The 

accuracy  of  the  Gauss-Chebyshev  quadrature  is  similar  to  the  piecewise-linear  quadrature 
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with  uniforrrj  mesh,  i.e,  if  n  —  80,  =  2.39723.  The  value  is  computed  from  the  exact 

j  =  4v'^r(3/4)/r(i/4). 

It  is  well  known  that  a  better  rate  of  convergence  may  be  attained  if  the  mesh 
concentrates  at  the  points  for  which  the  input  functions  or  their  derivatives  are  singular. 
For  the  "\/j7|  whose  derivative  is  singular  at  /  =  0,  the  nonuniform  mesh 
tj  -  i  =  0(l)(n/2),  and  t-  —  i  =  (r?/2)(l)n,  results  in  an  O(h^)  rate 

of  convergence  (column  7  of  Table  3-1),  The  0{h^)  rate  is  expected  for  this  choice  of 
nonuniform  mesh  (De  Boor  [4,  p.  46]).  However,  a  similar  improvement  on  the  rate  of 
convergence  of  the  Gauss-Chebyshev  method  is  not  possible  since  are  predetermined.  For 
piecewise-polynomial  quadratures,  an  additional  extrapolation  may  be  used  to  further 
improve  the  numerical  results.  This  is  demonstrated  in  the  last  column  of  Table  3-1, 
where  Extrap  —  (4  J2n  ~  Jn  )/S,  It  can  be  shown  that  the  order  of  convergence  for  the 
piecewise-linear  quadrature  in  (43)  is  O(h^)  for  functions  (Atkinson  [l,  p.  273]). 


Uniform  Mesh 


Nonuniform  Mesh 


n 

1 

1 

1  Error 

1 

P 

II  Jn 

II 

1  Error 

1  _  ^ 

P 

Extrap 

10 

1 

1 

2.36119 

1 

1  3.5x10“^ 

II 

li  2.36147 

1 

1  3.4x10“® 

20 

1 

2.38366 

1  1.3x10“2 

1.47 

II  2.38703 

1  9.3x10“® 

1.91 

2.39555 

40 

1 

2.39176 

1  4.5x10“® 

1.48 

II  2.39388 

1  2.4x10“® 

1.94 

2.39616 

80 

1 

2-39467 

1  1.6x10“® 

1.49 

II  2.39566 

1  6.2x10“^ 

1.96 

2.39626 

oo 

1 

2.39628 

1 

1.50 

1  2.39628 

1 

2.00 

2.39628 

_1 

J _ 1 _ 

_ 

J _ 1 

Table  3-1:  The  quadrature  (43)  for  g(<)  —  y/\t\  and  a  =  /?  =  1/2 

Example  2:  We  will  now  apply  the  piecewise-polynomial  quadrature  to  the  solution  of  a 
CSIE  with  variable  coefficients.  Welstead  [17,  p.  103]  has  solved  the  CSIE 

TTf 

j  COS  (yjw'lO  y(*) 

sin  (^)  tt)(«)y(«)  +  “-/  - — - dt  =  /(«)  (45) 

w(t)  =  ^  g  (46) 

by  using  orthogonal  polynomial  approximations.  Equation  (45)  is  equivalent  to 
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y(t)  =  sin(y)/(t)  -  J{F-,t)  - 
nt 

cos  (Y)/(t) 

J(F',t)  = 

e  7rO(t) 


)  -  m  j  -— 

J -l  s—t 


/> 


F{s)-F(t) 


l'±l, 

*/  - 1  6  —  t 


-ds  —  —  TTt 


then  the  piecewise-polynomial  quadrature  can  be  used  to  approximate  J(F;t). 


In  Table  3-2  we  present  the  approximation  y„{l)  of  y(l)  by  the  piecewise-linear  quadrature 

(15)  for  f{t)  =  Vl-t^  and  C  -  1/n,  We  again  select  a  uniform  mesh 

=  -1  -h  t /i,  i  -  0(l)n,  h  =  2/n  and  a  nonuniform  mesh  t,  =  -i  -h  i  =  0(l)(n/2), 

i  —  (n/2)(l)n  which  concentrates  at  il.  Note  that  the  decay  exponent  p  tends 
to  2.00  for  both  choices  of  mesh  points.  This  is  expected,  since 

Fit)  =  A/(e»r)  (50) 

A 

is  a  C^[-l,  1)  function  and  its  second  derivative  is  integrable.  More  specifically,  we  can 
show  that  f’(±l)  =  0,  f  '(±l)  =  Tl/e,  =  $i{t)ln  (1-t)  +  (1+*)  +  where 

*  =  1(1)3  continuous  functions.  If  F„(t)  is  the  piecewise-polynomial  approximation 
of  Fit)  defined  hy  (7),  then,  hy  using  Peano’s  error  formula,  we  easily  see  that 


|J(/’-F„;l)|  <  jf\l+t)^/^ll-t)-^/^lF"lt)ldt  (51) 

where  h  -  maxh^  for  all  Since  |ln{l±t)|  <  c(#)(l±t)“^  for  any  >  0,  where  c(#)  is  a 

positive  constant,  the  integral  in  the  last  equation  is  bounded  and  the  0{h^)  order  of 

convergence  is  obvious. 

We  observe  that  the  Ettot  in  the  uniform  mesh  part  of  the  Table  is  about  half  the  Error 

of  the  nonuniform  mesh,  which  is  explained  by  (51)  and  the  fact  that  max  /i,  is  larger  for 

the  nonuniform  mesh.  However,  the  decay  exponent  p  for  the  nonuniform  mesh  tends  faster 
to  2.00  and  consequently,  the  error  for  the  nonuniform  extrapolation  is  smaller  than  that  of 
the  uniform  extrapolation  for  the  same  n.  The  y^(l)  =  0,518592  is  computed  by  choosing 
n  =  2560. 


837 


The  numerical  solution  derived  by  Welstead  [17].  with  the  use  of  orthogonal  polynomials 
with  respect  to  uj(i)  in  (45),  is  yj\)  ^  0.518583  for  n  =  40.  The  generation  of  such 
polynomials,  mesh  points  and  quadrature  weights  for  nonclassical  weight  functions  require 
considerable  computational  effort  since  a  Stieltjes  procedure  to  determine  the  recurrence 
coefficients,  Fejer’s  quadrature  to  evaluate  the  integrals  involved  and  the  solution  of  an 
eigenvalue  problem  via  Lanczos’  algorithm  would  have  to  be  implemented  (e.g.  Welstead 
[17,  p.  115])-  Therefore,  it  is  obvious  that  the  computational  effort  will  increase  with 
the  dimension  n  of  the  approximation.  It  is  not  clear  whether  this  approach  is 
computationally  feasible  for  large  n’s.  However,  the  computational  effort  for  the  piecewise- 
polynomial  method  increases  slowly  as  n  increases,  since  the  series  for  all  additional 
integrals  converge  fast  (see  Theorems  3  and  5).  Using,  the  piecewise-linear  method,  we 
have  routinely  solved  problems  with  n  =  5000  without  encountering  any  numerical 
instability. 


Uniform  Mesh 


Nonuniform  Mesh 


n 

1  »n(l) 

J _ 

1  Error 

1 

P 

II 

1 . 

1  Error 

P 

Extrap 

10 

1 

1  0.514213 

4.3x10“^ 

II 

II  0.508712 

1  9.9x10“® 

20 

1  0.517349 

1  1.2x10“® 

1.82 

II  0.515989 

1  2.6x10“® 

1.92 

0.518414 

40 

1  0.518251 

1  S.lxlO"** 

1.87 

1  0.517931 

1  6.6x10“'^ 

1.98 

0.518579 

80 

1  0.518501 

1  9.0x10“® 

1.91 

II  0.518426 

1  l.TxlO”"* 

2.00 

0.518592 

oo 

1  0.518592 

2.00 

II  0.518592 

2.00 

0.518591 

Table  3-2:  The  quadrature  (15)  with  a  —  p  —  -1/2  and  defined  in  (50). 


Note  :  All  computations  were  performed  on  a  DECSYSTEM/2060T  using  FORTRAN  77 
with  double  precision  floating  point  arithmetic  (with  a  mantissa  of  16  to  18  decimals  and 
with  an  exponent  in  the  range  0.14x10^^®  to  3.4x10®®). 
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NUMERICAL  SOLUTION  OF  RANDOM  LINEAR 
VOLTERRA  INTEGRAL  EQUATION* 

M.  Sambandham 
Department  of  Mathematics 
Atlanta  University/Morehouse  College 
Atlanta,  GA  30314 


ABSTRACT .  Numerical  solutions  are  obtained  to  Volterra 
integral  equations  with  random  nonhomogeneous  terms.  The 
method  we  use  is  Lobatto  quadrature  formula.  Based  on  the 
simulation  of  random  forcing  term  the  numerical  solutions  are 
used  (i)  to  compare  the  convergence  of  the  average  of  the 
random  solutions  to  the  solution  of  the  average  equation, 

(ii)  to  evaluate  the  confinement  probability  and  risk  func¬ 
tionals,  and  (iii)  to  discuss  the  convergence  of  the  solution 
processes  of  the  random  equations  to  the  solution  of  the 
deterministic  equation  through  the  sample  path  graphs,  as  the 
variance  of  the  random  forcing  term  tends  to  zero. 

I .  INTRODUCTION .  An  integral  equation  of  the  form 

X 

y(x)  =  g(x)  +  /  K(x,t)  y(t)dt  (1) 

^0 

is  said  to  be  a  Volterra  integral  equation  of  the  second  kind 
in  which  K(x,y)  is  the  kernel  and  g(x)  is  the  nonhomogeneous 
term  (or  forcing  terra) .  Probabilistic  analog  of  (1) ,  namely, 
a  random  Volterra  integral  equation  is  defined  as  follows: 

X 

y(x,aj)  =g(x,a))  +  /  K(x,s,oo)y(s,w)ds,  (2) 

^0 

where  K(x,y,w)  is  the  random  kernel  and  g(x,aj)  is  the  random 
forcing  term.  The  parameter  ui  is  an  element  of  a  given  prob¬ 
ability  measure  space  (fl,A,ij). 

In  this  article  we  obtain  the  numerical  solution  of  (2). 
We  take  the  available  deterministic  methods  of  solving  a 
Volterra  integral  equation  and  suitably  adopt  these  methods 
to  generate  the  sample  solutions  of  the  random  Volterra 
integral  equation  (2) . 


*Research  supported  by  ARO  Grant  No.  DAAG29-85-G0109 . 
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For  related  work  on  random  Fredholm  equation  of  second 
kind  see  [3,9].  In  [3]  random  Fredholm  equation  is  considered 
with  a  random  forcing  terra  or  a  random  kernel.  in  [9]  a 
singular  integral  equation  with  random  forcing  term  is  con¬ 
sidered.  Lax  [8]  has  used  the  method  of  moments  to  obtain 
the  mean  and  the  autocorrelation  of  random  Volterra  integral 
equation.  Tsokos  and  Padgett  [10]  have  employed  the  method 
of  successive  approximation  to  solve  random  nonlinear  Volterra 
integral  equations.  For  the  several  discussion  and  methods 
of  numerical  solution  of  Volterra  integral  equations  refer 
to  Baker  [1],  Golberg  [4].  Other  important  numerical  method 
is  Lobatta  Method  (Jain  and  Sharma  [7]). 

We  organize  our  article  as  follows.  ’  In  Section  2  we 
present  a  short  review  of  the  Lobatto  method  [7].  In  Section 
3  we  present  the  numerical  technique  we  employed  to  solve  the 
random  Volterra  integration  equation.  The  concept  of  confine¬ 
ment  probability  and  risk  functionals  are  discussed  in  Section 
4 .  Some  numerical  results  and  figures  are  presented  in 
Section  5  and  a  short  discussion  and  conclusion  are  in  Section 
6 . 


II.  LOBATTO  QUADRATURE  FOR^IULA.  Consider  the  Volterra 
linear  integral  equation  of  second  kind; 


y(x)  =  g(x)  +  /  K  (x,  t)  y  (t)  dt . 


(3) 


In  (3)  y(x)  is  numerically  to  be  determined  for  a  given  con¬ 
tinuous  forcing  function  g(x)  and  jointly  continuous  kernel 
K(x,t) ,  Equation  (3)  gives  the  values  of  y(x)  at  x  =  x^  + nh 


as 


n 

y(x  )  =  g(x  )  +  /  K(x  ,t)y(t)dt 

n  n  n 

^0 

n-1  ^p+1 

=  g(x  )  +  Z  /  K(x  ,t)y(t)dt. 

n  »  n 

p=0  X 

^  P 

Now  approximating  the  integral  in  (4)  by  the  following 
quadrature  formula 

='0**'  .  4 

/  g(x)dx  =  Z 
Xq  P=1 


(4) 


(5) 


where 
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=  1/6,  W2  *  «  5/6 


'1  "4 

^1  "  ^0' 


T2  =  Xq  +  rh,  r  =  (5  -  /5)/10 
=  Xg  +  sh,  s  =  (5  +  /5)/10 


T4  -  Xq  +  h, 

3.2^  •  15750 


f  ^0 


^  t  <  x.+h. 


the  approximate  value  of  y(x^)  is  given  by 


y(x„)  =  [12g(x^)  +  h{K(x^,x^.^)y(x^_^) 


+  5K (x  , X  , ) y (x  , ) } 
n'  n+s-1  ^  '  n+s-il 


n-2 

+  h  Z  {K(x^,x^)y(x  )■ 

p=0 


n  p 


+  5K  (x  ,  X  )  y  (x  .  ) 
n  p+r  ^  p+r 


+  5K(x  ,x  )y(x  ) 
n  p+s  p+s 


where 


y'Vr' 


.  Th 


62p' 

+  '<“2p8op  -  '‘0p«2p>y<^p>  "•  '^2p^<Vs’ 


■  ®2p5' Vr*'  ®lp  ' 

•  ‘'“lp®0p  -  “opV’=^*’'p’  *“lp5'Vs’ 
-  6lp?'Vr'’‘^  82P  - 


(6) 
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y(x  .  )  -  y(x  )  (ha 


p+s 


A/5 


-  -  V  ) 


Ip  "2p 


^  ><“2p®0p-“0p82p>y'^p>  *  “2p9<Vs’ 

■  ®2p9‘Vr”'^  “ip  ■ 

-  "“ip^Op  -  Veip)y(Xp)  +  a^pg(Xp^^) 

3 

-  8lpy‘Vr>  “2p  -  rrVl/A, 


p+r 

“np  "  /  K(Xp^^,x)  (x-Xp)’^dx,  n  =  0,1,2, 


X 


0 


p+s 
/ 

X, 


^np  "  /  K(Xp^g,x)  (x-Xp)''dx,  n  =  0,1,2, 


^  “lp^2p  "  “2p®lp  ' 


A  =  Y[h(a^pS  h  -  h  +  r3,„  -  sa^^)  -  Jl_  -  y] . 


Ip 


2p  •"-2p^ 


5/5 


This  equation  (6)  describes  a  single-step  method  for  the  solu¬ 
tion  of  (3)  . 


III.  RANDOM  VOLTE RRA  INTEGRAL  EQUATION  AND  THE  NUMERICAL 
METHOD^ The  equation  that  we  are  interested  in  is  the  fol- 
lowing  equation: 


y(x,aj)  =  g(x,oj) 


X 

/  K  (x,  s ,  u))  y  (s ,  u))  ds 


(7) 


where  g(x,(ij)  is  the  random  forcing  function  to  be  simulated. 
For  each  sample  function  g(x,u)  using  Lobbato  method  equation 
(7)  is  solved.  For  our  numerical  work  we  consider  the  fol¬ 
lowing  two  examples : 


2 

Example  1:  g^(x,'ju)  e  N(x,a  x)  , 

Kj^(x,s,a))  =  s  -  X  t 

=  constant. 
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2 

Example  2;  g2(x,a))  e  N(x+1,  o  x) 

(x,s,a))  =  X  -  s  j 

2  -  n 

a  “  constant^  Xq  -  u. 

We  notice  that  in  Examples  1  and  2  Egj^(x,(i))  =  x  and 

E(g2(x,w)  )  =  x+1.  These  factors  slx>w  that  the  solutions  of  the 

average  of  equation  (7)  for  Examples  1  and  2  are  respectively 

sin  X  and  e  .  Further  we  note  that  if  a  =  0  then  the  random 
equation  (7)  becomes  deterministic.  For  our  numerical  cal¬ 
culations  we  take  a  values  tend  to  zero  so  that  we  get  the 
deterministic  solution  at  o  =  0.  This  factor  is  an  addi¬ 
tional  advantage  that  we  get  the  deterministic  solution  as 
a  particular  case  of  our  numerical  example.  The  graphs  and 
numerical  data  for  different  a  are  presented  in  Section  5. 

IV.  CONFINEMENT  PROBABILITY  AND  RISK  FUNCTIONAL.  The 
concept  of  confinement  probability  and  risk  functional  are 
useful  techniques  in  the  probabilistic  numerical  analysis. 

To  be  more  specific  these  are  useful  in  the  analysis  of 
stability  of  continuous  random  systems  with  nonstationary 
responses."  The  confinement  probability  can  be  considered  to 
be  a  real  valued  functional  of  random  function.  For  any 
positive  constant  the  confinement  probability  function 
F  for  any  random  solution  y(x)  is  defined  by 

Ot 


=  P(|y(x)  |)<  a,  for  all  x  e  [0,t])  (8) 


Those  systems  for  which  the  confinement  probability  decays  to 
zero  slowly  are  considered  to  be  more  stable  than  those  for 
which  the  confinement  probability  approaches  to  zero  more 
rapidly  as  the  system  evolves  in  time. 

The  concept  of  risk  functional  is  useful  to  check  whether 
the  system  exceeds  certain  bounds.  Suppose  there  is  certain 
risk  if  the  absolute  value  of  y  exceeds  certain 
constant  y.  Then  the  risk  functional  G  is  defined  by  the 
relation  ^ 


yU)  s  -yl  u  t  inax  y(C)  2  yH.  (9) 

'  0^5<x  0<5<x 

Confinement  probability  and  risk  functional  are  useful 
techniques  one  can  implement  and  analyze  during  the  numerical 
procedure  of  probabilistic  numerical  analysis.  These  concepts 
can  be  used  to  check  the  stability  and  error  analysis  during 
the  numerical  experiments.  In  the  next  section,  we  discuss 
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these  concepts  for  Examples  1  and  2.  For  more  details  on 
confinement  probability  and  risk  functionals  see  [5,6]. 

V.  NUMERICAL  RESULTS.  From  IMSL  subroutine  we  generate 
standard  normal  random  variables.  Let  it  be  R(u))  .  Then  for 
a  >  0,  let 

(i)  g^(x,u)  =  X  +  a*/x  R(aj)  , 

(ii)  g2(x,a))  =  X  +  1  +  a/x  R(aj)  . 

We  remark  that  E(gj^(x,(o))  =  x  and  E(g2(x,cio))  =  x+1  and 

Var  (g^  (x,  (jj)  )  =  a^x,  i  =  1,2.  We  notice  that  if  a  =  0,  gj^(x,u) 

and  g2{x,a3)  are  deterministic  functions.  We  simulate  sample 

functions  for  g^(x,(i))  and  g2(x,(i))  and  use  Lobatto  method  to 

solve  Examples  1  and  2.  Our  numerical  results  are  based  on 
40  samples.  For  Example  1  we  analyze  when  x  e  [0,7t]  and 

for  Example  2  we  study  for  x  e  [0,3].  For  the  numerical 

studies  we  took  a  =  1.0,  0.5,  0.1,  0.05,  0.01,  0.  For  these 
six  values  of  a,  we  present  the  sample  path  graphs  for 
Examples  1  and  2  in  the  Figures  1-6  and  7-12  respectively. 
These  graphs  illustrate  the  convergence  of  the  sample  paths 
to  the  exact  values  (a  =  0) .  Table  I  and  II  represent  the 

confinement  probability  and  risk  functional  values  for 
a  =  1.0,  a  =  0.1  respectively  for  the  Examples  1  and  2. 

Table  I . 


y(x)  =  sin  x,  0  ^  sin  x  i  1 


Confinement  Probability 

Risk  Functional 

X 

a  =  1.0 

a  =  0.1 

a  =  1.0 

a  =  0.1 

(0, 

.2) 

0.710 

1.000 

0.290 

0.000 

(0, 

.4) 

0.675 

1.000 

0.325 

0.000 

(0, 

.6) 

0.640 

1.000 

0.360 

0.000 

(0, 

.8) 

0.610 

1.000 

0.389 

0.000 

(0, 

1.0) 

0.585 

1.000 

0.415 

0.000 

(0, 

1.2) 

0.567 

0.984 

0.433 

0.016 

(0, 

1.4) 

0.551 

0.944 

0.449 

0.056 

(0, 

tt) 

0.528 

0.880 

0.472 

0.120 

S46 


Fig  1  ;  (S'*  1.0  (Exanple  1) 


Fig  2  :  O' =  0.5  (  Example  1) 
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Trajectory 


Fig  3 


o' =  0,1  (Example  1) 


Trajectory 


Fig  4  o' =  0.05  (  Example  1) 
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Fig  7;  d"  =  1.0  (Example  2) 
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Table  II. 


,  X  X  ,  X  3 

y(x)  =  e  ,  1  <  e  <  e 


Confinement  Probability 

Risk  Functionals 

X 

a  =  1.0 

a  =  0.1 

1 

0  =  1.0 

a  =  0.1 

(0, 

.5) 

0.738 

1.000 

0.262  I 

0.000 

(0, 

1.0) 

0.800 

1.000 

0.200 

0.000 

(0, 

1.5) 

0.837 

1.000 

■  0.163 

0.000 

(0, 

2'.0) 

0.865 

1.000 

0.135 

0.000 

(0, 

2.5) 

0.887 

1.000 

0.113 

0.000 

(0, 

3.0) 

0.894 

0.9901 

0.106 

0.099 

VI.  DISCUSSION.  Figures  1-6  and  7-12  illustrate  the 
convergence  of  the  exact  value.  These  figures  clearly  illus¬ 
trate  the  random  disturbances  at  each  stage.  Further  the 
importance  of  the  probabilistic  models  and  determinstic 
models  are  clearly  visualized  from  the  graphs. 

The  discussion . in  Section  4  illustrate  that  confinement 
probability  and  risk  functionals  are  useful  techniques  one  can 
use  in  the  probabilistic  numerical  analysis.  The  data  in 
Tables  I  and  II  confirm  that  the  random  solution  of  the  random 
equation  converges  faster  to  the  solution  y(x)  =  sin  x  of  the 
corresponding  deterministic  equation  in  the  case  of  a  =  0.1 
corrpared  to  the  case  of  a  =  1.0.  For  the  same  parameter 
values  a  =  0.1  path  converges  in  the  second  example 

(y(x)  =s  e  )  even  faster  than  the  case  of  Example  1  (y  =  sin  x)  . 
Further  the  risk  functionals  illustrate  the  probability  that 
the  solution  exceeded  certain  limits .  Though  these  factors 
are  utilized  here  to  discuss  the  solution  processes,  one  can 
suitably  use  these  factors  for  analysis  and  other  topics  of 
consideration  in  the  probabilistic  numerical  analysis. 
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Academic  Press,'  New  York,  1974 . 
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OCM^IS  Otl  FIKITO  EC^MEHr  MCXllOD  AND  BANO-WlDiU  KEUUCriON 
WllU  HEFCBENCE  TO  TBANSIENF  HEAT  OQNDUCriCtl 


R.  Yalamanchill 

Technology  Branch,  Arniantenb  Division 
Fire  Obnbrol  and  Snail  Cciliber  Weapon  Sys bents  Laborabory 
US  Arnty  ARDC,  AMOOCM,  Dover,  HJ  07B01-5001 


ABSTRACT,  Trans ienb  bwo-diiuenslonal  heab  oonducbibn  was  analyzed  by 
£inibe*^lemei)b  beclinigues  (F£M).  Ihe  nosb  oouuion  assunipbions  such  as  linear 
beiiperabure  disbribubion  wibhin  each  eleinenb  and  wibhin  each  biine  Increinenb 
were  ubilized.  The  resulblng  difference  equablons  were  sbudied  againsb  the 
niebliod  of  weighbed  residuals  (MWR)  and  various  finite-difference  techniques 
IFCM)  with  cnpliasis  on  sbabillby,  accuracy  and  nonosclllabion  diaracterlstlcs. 
These  codes  possess  builb-in  algorithms,  such  as,  the  Cubhill-McKee  algorithm 
for  optimization  of  the  bandwidth  of  the  matrix.  A  ladle  of  nolten  tnetal  was 
simulated  by  this  algorithm.  Even  though  it  book  10  iberabions,  a  minimum 
bandwidth  or  26  was  obtained.  The  manual  nunibering  of  nodes  yielded  the 
bandwidth  as  17.  Hiere  is  a  wide  scope  for  further  researdi  and  developnenb 
of  new  and  ittproved  algorithms. 

I .  lNrR0[lX7ri^ .  The  fininte-elenient  method  of  describing  continuous 
systems  was  first  introduced  In  the  mid-1950'8  and  has  since  become  an 
extremely  useful  engineering  technique.  ,  The  FEM  has  been  applied  in  a  variety 
of  fields  including  stress  analysis, . fluid  dynamics,  and  field  theory.  tk>rrie 
and  de-Vris  Cl3llst  over  7,000  reference's  in  a  bibliography  published  in  1976. 
Numerous  references  exist  in  the  case  of  the  finite-difference  method  and  also 
the  method  of  weighted  risiduals*  It  is  nob  the  objective  to  discuss  such  a 
vast  literature.  Ratlier,  it  is  the  int^t  bo  summarize  conclusions  based  on 
tile  author's  research  even  though  it  inay  be  considered  controversial, 
liopefully,  sudi  a  statement  may  stimulate  further  researdi  and  bear  fruitful 
findings  later. 

The  transient  two-dimensional  heat  conductlcxi  was  analyzed  by  FEM  with 
popular  assuiiptlons  sudi  as  linear  tenperature  distribution  within  eadi 
element  and  within  each  time-increment.  ‘  The  resulting  equations  were  coHpared 
against  the  result  of  the  MWR  and  FEM  with  enthasls  on  stability,  accuracy, 
and  nonoscillation  diaracterlstlcs*  The  eff^t  of  relaxation  of  Uiose 
assunptlons  upon  the  conclusions  is  questioned. 

Various  FEM  codes  C  2,  3,  43  are  available  for  analysis  of  engineering 
problems  with  any  geometry,  nonlinear  material  properties,  and  nonlinear 
conditions.  These  codes  possess  algorithms,  such  as  the  Cuthlll-McKee  C  5 1 
algoritiun,  for  optimizatlpn  (inlniinizabion)  of  ttie  bandwidth  of  the  matrix 
(resulting  system  of  equations).  It  is  inportant  to  realize  the  effect  of 
bandwidUi  on  core  storage  and  caiputational  tlm^s,  especially  for  multi¬ 
dimensional  problems.  It  is  deiionsbrabed  that  these  algorithms  do  nob 
r^ecessarily  yield  the  minimum  bandwidth  and  therefore,  manual  nuuiberlng  of 
nodes  may  be  better  if  practicable. 

The  author  of  this  paper  presented  It  at  the  Second  Army  Conference  of  Applied 
Mathematics  and  Computing. 


2.  FINITE  El^MEOT  t^OpOLOGy.  Gurtin  1 6]  introduced  the  variational 
princxpie  for  lineir~initial  value  prdblems  in  1964  and  confirmed  that  the 
funciton  T(x,y,t)  which  leads  to  an  extremum  of  the  functional 


iW)  =•  -  r  |pc,r*r  +  vr*K*VT  -  ‘2pc,ivr\dv 

2  J  V 

-  J^Qm^rds  (1) 

is  the  solution  of  the  transient  heat  conduction  equation 


(.K*T.  t).  t  -  pC,*  ~  -  0  (2) 

with  the  boundary  condition 

K*T.  ,~Q,~0  (3) 

Where  T  (x,y,t)  is  the  tenperature  at  the  spatial  point  (x,y)  and  time  t,Tois 
the  initial  tenperature,  vt  is  the  gradient  of  T,  K  is  the  thermal 
conductivity,  P  is  the  material  density,  Cp  is  the  heat  capacity  of  the 
material  per  unit  mass.  Qi(x,y,t)  »  /Qi(x,y,t)dt  . 

V  is  the  volume  and  *  is  the  convolution  synbol  defined  as 


T*T  -  J*  T{x,  y,  t  -  t)2'(*,  V,  r)* 


dT*  bT  bT*  bT 
vr*vr  -  ~  (4) 

dx  o:c  dy  oy 


The  two-dimensional  body  was  divided  into  square  elements  of  length  AL  and 
linear  tenplerature  distribution  was  assumed  within  each  element.  The 
integration  in  the  functional  equaticm  was  accomplished  and  applied,  the  first 
variation  with  respect  to  the  nodal  temperature,  '^i.j.k+l  ‘  An  additional 

assumption  r^arding  variation  of  nodal  temperature  with  respect  to  time 
within  any  time  increment  is  necessary  in  order  to  evaluate  the  mid-terra  of 
the  functicxial.  If  a  linear  variation  (with  respect  to  time  within  each  time 
step)  is  assumed,  the  finite-element  difference  equation  (FEDE)  can  be  written 
asj 

+  ^^7V/,*+i  -h  7?7*i,yH,*+i  + 

+  ^’7'f_i,/.*  +  DTi-i,j+uk 

ETi,j^uk  +  +  ET -f  DTi^tj-uk  + 

+ 

where 


A  « 


30 


4.8- 

<"-4  +  5* 
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l  +  i 

30  ^  3 


£  -  i  +  ^  fl 
9  3 
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KAt 
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3.  OOMMEWTS.  Even  though,  the  application  of  MWR-Col location  Yields 
finite-difference  equations  by  the  use  of  any  of  the  folloMing  Laplacian  term 
approximations  (v^x  )  ,  this  is  not  the  case  for  FEDE; 


i-1.3  ^  ^1.3-1  \3*i  *  Vi.j 

(AL)* 

2  (AL)» 


L  16  ^ 


(7) 


L  19 


j+i 


6  (4L)* 


L  17  19 


3  (AL)* 


(8) 


If  only  L17  19  is  chosen  as  the  laplacian  approxiinaticxi,  one  can  prove  that 
the  MWR-Collocation  yields  finite  difference  equation  and  the  MWR-Galerkin 
yields  FEIDE.  The  question  now  arises  whether  or  not  LI?  19  is  the  best  one 
because  the  unique  FEDE  can  be  obtained  only  by  L  17  19  where  as  numerous  IWl 
and  FEM  can  be  formulated.  If  accuracy,  stability,  and  nonoscillaticn 
characteristics  are  derived,  the  order  from  best  to  least  suitable  is  as 
follows; 

Accuracy:  1,19  >  L16  *  LlT^  19  ^  L17 

Stability:  L17  >  L1719  >  U9  >  L16 
Nonoscillation:  L17  “  L1719  ^  1,19  >  L16 


Certainly,  the  L17  19  is  not  the  best  one.  Therefore,  FEM  may  not  be  the  best 
as  far  as  accuracy,  stability  and  nonoscillaticn  characteristics  are  concerned 
What  happens  if  nonlinear  variations  are  considered  instead  of  linear  nodal 
tenperature  variations  within  each  element  and  also  within  each  time 
increment?  We  do  know  nonlinear  tenperature  distributions  in  space  were 
considered  in  many  finite-element  codes.  However,  nonlinear  variation  with 
respect  to  time  is  not  considered  to  the  best  of  the  author's  knowledge.  Has 
anyone  derived  FEEX!  with  the  assunption  of  ncxxlinear  distribution  in  space? 

Do  the  conclusion  with  respect  to  accuracy,  stability  and  nonoscillation 
cliaracteristics  remain  valid  for  nonlinear  variations?  Further  research  is 
needed  to  answer  some  of  these  questions. 
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4.  STORAGE  AND  BANEWIDTH.  Everyoie  ends  up  with  a  system  of  linear 
algebraic  equations  irrespective  of  the  use  of  any  technique  whether  it  is 
FDM,  MWR  or  FEM.  In  the  end,  ccxtputers  are  used  in  association  with  some 
technic[ues  for  the  soluticn  of  simultcineous  linear  equaticais.  A  smart  way  of 
resolving  these  equaticMis  is  to  use  a  banded  matrix  solution  technique  which 
has  the  advantages  of  speed  and  of  using  minimum  conputer  core  storage.  This 
inplies  that  one  should  ensure  minimum  matrix  bandwidth.  Indeed,  one  should 
also  take  advantage  of  sparseness  (zeros)  of  a  matrix.  The  following  exanple 
illustrates  the  inportance  of  minimizaticn  of  core  storage: 


Table  1 .  Storage 


#  of  eqns 

Full  matrix 

t^per  Matrix 

Upper  Bainded  Matrix 

Bandwidth 

1 . . . 

100 

10,000 

5,050 

1,045 

11 

500 

250,000 

125,250 

24,225 

51 

1000 

1,000,000 

500,500 

49,725 

51 

95,950  i 

101 

One  may  achieve  the  minimizaticxi  of  matrix  bandwidth  by  numbering  the  nodal 
points  in  a  structure  in  a  particular  manner.  Either  conplex  structure  or 
inadvertent  numbering  yields  an  inefficient  system  of  equations  and  thus  one 
has  to  rely  on  automatic  renumbering  algorithms  such  as  Cuthill-McKee.  The 
following  exanple  illustrates  its  mechanics  and  effectiveness: 

Tbble  2.  Cuthill  -  McKee  Technique 
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5.  DISCUSSICW.  Ihe  importance  of  minimization  of  core  storage  and 
reducticsi  of  matrix  bandwidth  is  demonstrated  in  Table  1.  The  automatic 
generation  of  mesh  and  automatic  numbering  and  renumbering  of  nodes  is  quite 
common  in  most  finite-element  conputer  codes.  Ihe  renumbering  of  nodes  is 
accomplished,  manually,  by  the  Cuthill-Mct<ee  technique  for  a  simple  example. 
Fortunately,  the  niminum  bandwidth  is  obtained  in  this  case.  Scxnetimes,  this 
may  not  be  true.  CJonsider  the  following  example  with  40  quadrilateral  (8- 
node)  elements.  Because  of  symametry,  this  object  may  be  considered  as  one- 
half  of  a  crucible  or  rectangular  channel  filled  partially  by  a  hot  molten 
metal.  There  are  a  total  of  149  nodes.  For  example,  the  Marc  Code  generates 
the  mesh  and  assigns  the  node  numbers  as  shown  for  given  geometry. 


Ihe  autcxnatically  generated  and  numibered  scheme  does  contain  the  bandwidth  of 
65.  However,  if  an  iterative  (say  10  iterations)  Cuthill-McKee  algorithm  is 
called  for  minimizaticxi  of  matrix  bandwidth,  the  following  renumbering  is 
acconplishesd  with  a  bandwidth  of  26. 
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Cuthlll-Mckee  Manual 


The  bandwidth  is  29,  29,  32,  35,  46,  29,  26,  26,  26,  and  26  for  iterations 
1,2, 3, 4, 5, 6, 7, 8, 9,  and  10  respectively.  Certainly,  the  final  bandwidth  is  not 
an  optimum  choice.  The  manual  nunibering  as  shown  above  accomplished  a 
bandwidth  of  17  based  on  intuition.  Thus,  significant  savings  in  both  core 
storage  space  and  conuputaticxis  is  acconplished.  Therefore,  a  prudent 
programmer/analyst,  whereever  possible,  should  not  rely  on  automation  alone. 
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